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This  dissertation  discusses  research  on  signal  source  generation  schemes 
for  embedded  test  circuits.  Conventional  signal  generation  methods  are  briefly 
reviewed.  A system  application  example  using  the  embedded  signal  sources  with  a 
signal  detection  circuit,  an  on-chip  substrate  characterization  method,  is  proposed. 
Without  the  need  of  final  layout  information  or  an  assumed  substrate  model  for 
electrical  behavior  of  substrate,  this  method  can  extract  the  signal  propagation 
behavior  of  substrate  itself.  This  enables  circuit  designers  to  consider  substrate  noise 
effects  in  the  early  design  stage  for  reducing  the  design  turn-around  time.  A memory- 
based  periodic  bitstream  approach  is  presented.  Also,  a comparison  of  this  proposed 
approach  with  existing  techniques  is  done  in  the  frequency  domain.  The  memory- 
based  periodic  bitstream  approach  uses  a minimum  set  of  frequency  bins  and  requires 
a tunable  noise-shaping  filter.  However,  the  post-processing  analog  filter  design  is 
not  straightforward  and  becomes  an  on-chip  test  design  bottleneck.  Therefore,  a 


linear  feedback  shift  register  (LFSR)  approach  using  finite  impulse  response  (FIR) 
filtering  is  presented  to  extract  better  quality  signals  from  square-wave  pulses 
generated  by  simple  digital  hardware.  Since  the  FIR  filtering  reduces  the  unwanted 
harmonic  components,  a noise-shaping  filter  can  be  implemented  using  passive 
elements  without  the  necessity  of  being  frequency-tunable.  The  fabricated  sample 
measurement  shows  up  to  45dB  and  40dB  spurious-free  dynamic  range  (SFDR)  for 
single-tone  and  dual-tone  signal  source  generation,  respectively. 

The  proposed  memory-based  approach  generates  a higher  frequency 
signal  source  than  the  original  periodic  bitstream  method  by  minimizing  the  number 
of  frequency  bins,  and  it  enables  a simple  test  setup  and  quick  system  launching  by 
eliminating  the  external  initialization  requiring  additional  automatic  test  equipments 
(ATEs).  Using  the  LFSR  approach  with  FIR  filtering,  the  signal  source  generation 
system  is  simpler  than  the  memory-based  approach  by  simplifying  the  digital  signal 
processing  circuits  and  relaxing  the  analog  filtering  specifications. 
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CHAPTER  1 
INTRODUCTION 

1 . 1 Why  Embedded  Test? 

Current  system  design  trends  are  to  mix  diverse  types  of  circuits,  such  as 
random  logic,  memory  blocks,  microprocessors  or  digital  signal  processing  (DSP) 
cores,  and  analog  blocks,  in  a single  integrated  System-on-Chip  (SoC).  As  chips 
become  more  integrated,  more  advanced  and  complex  circuit  blocks  are  being  put 
together.  The  ability  to  design,  manufacture,  test  and  debug  complex  SoCs  is 
increasingly  important  for  the  electronics  industry  success  [Eis99,  Tri03] . 

However,  this  SoC  design  trend  creates  some  critical  challenges  for  test, 
specifically  analog  and  mixed-signal  manufacturing  test  [Bla03],  For  mixed-signal 
designs,  testing  is  becoming  a major  cost  factor  in  the  overall  integrated  circuit  (IC) 
manufacturing  costs.  In  the  commercial  market,  up  to  80%  of  the  test  costs  are  due 
to  the  analog  blocks  or  functions  which  typically  occupy  only  around  10%  of  the  total 
chip  area  [DufOOa,  Var98].  Therefore,  without  new  test  developments,  complex 
mixed-signal  chips  may  be  manufacturable  but  become  very  costly  to  sell  after 
production  test  without  redesign. 

Traditional  test  and  diagnosis  methods  require  painful  engineering  efforts 
to  develop  the  complex  SoC  devices  containing  mixed-signal  blocks,  increasing  the 
time  it  takes  a system  to  market.  This  is  because  the  traditional  test  method  is  a flat 
methodology  which  performs  a test  after  the  final  design  stage  on  the  entire  SoC 
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design  level  [FujOl,  ZorOO],  Even  though  the  SoC  design  is  typically  built  block  by 
block,  the  conventional  method  does  not  take  advantage  of  that  block-based 
hierarchical  design  methodology. 

In  order  to  fully  make  use  of  the  hierarchical  design  structure  of  SoCs, 
design-for-test  (DfT)  and  built-in-self-test  (BIST)  techniques  for  analog  and  mixed- 
signal  circuits  have  recently  been  introduced  for  helping  alleviate  increasing  test 
overhead  [BurOl],  With  a specialized  embedded  test  block  installed  in  the  main 
system  itself,  this  embedded  test  technique  enables  IC  designers  or  SoC  developers 
to  use  lower-performance  DfT  systems  to  provide  test  coverage  at  or  beyond  that 
expected  from  the  much  more  expensive  ATE  systems.  This  increases  test  efficiency 
by  minimizing  testing  time  and  the  cost  of  measurement  at  different  design  and 
manufacturing  steps,  and  the  increased  efficiency  contributes  to  reduced  product 
turn-around-time  (TAT). 

1.2  Review  Of  Signal  Generation  For  Embedded  Test 
There  are  two  kinds  of  non-A/D  based  signal  generation  methods  used  by 
industry,  digital  square-wave  generation  and  analog  sinewave  generation.  In  this 
section,  several  conventional  methods  for  generating  signal  sources  will  be  reviewed 
and  considered  as  candidates  as  embedded  sources  for  an  integrated  self-testing 
structure. 

1.2.1  Conventional  Signal  Generation  Methods 

Digital  square-waves  can  be  simply  generated  by  using  a ring  oscillator, 
which  is  configured  as  a cascade  of  an  odd  number  stage  inverters  with  a feedback 
connection,  as  shown  in  Fig.  1.1.  Ring  oscillators  can  be  used  to  estimate  the  average 
propagation  delay  time  by  measuring  the  oscillation  frequency.  Also,  more 
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frequently,  this  circuit  generates  a digital  square-wave  signal  with  a specific 
capacitive  load  and  inverter  size.  Although  the  square-wave  is  a good  clock  source 
for  many  digital  systems,  it  is  not  appropriate  for  the  stimulus  of  analog/mixed-signal 
embedded  test  due  to  its  uncontrolled  odd  harmonics. 


(b) 

Figure  1 . 1 Cascaded  inverter  structure;  (a)  odd  number  stage  configured  as  a ring 
oscillator  (b)  even  number  stage  configured  as  a latch,  where  n is  an 
integer  greater  than  zero. 

An  oscillation  can  be  launched  with  the  negative  feedback  loop  and  the 
gain  stage  with  180  phase  shift.  For  steady  oscillation,  two  conditions  must  be 
simultaneously  met  at  the  oscillation  frequency,  which  are 

1)  the  loop  gain  of  the  gain  stage  must  be  equal  to  one,  and 

2)  the  phase  shift  from  the  gain  stage  must  be  0 or  180  degrees  with 
negative  DC  feedback. 

The  above  conditions  are  called  Barkhausen's  criteria,  and  these  are 
necessary  but  not  sufficient  conditions  for  oscillation  [Ngu92],  In  most  cases,  to  keep 
a stable  oscillation  frequency,  a frequency-selective  block  is  included  in  the  feedback 
loop  [Raz98],  A good  example  of  the  frequency-selective  block  is  an  LC  resonator. 
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Many  RF  oscillators  use  as  few  active  devices  as  possible,  trying  to  minimize  the 
noise  associated  with  them.  This  is  why  only  one  transistor  is  incorporated  in  the 
oscillatory  circuits.  Fig.  1.2  shows  two  typical  topologies  of  LC  tank  oscillators, 
which  has  one  bipolar  transistor  with  collector  to  emitter  feedback. 


Qi 


(a)  (b) 

Figure  1.2  Typical  LC  oscillators  using  an  active  transistor;  (a)  Colpitts 
oscillator  and  (b)  Hartley  oscillator  structures. 

While  LC  oscillators  can  easily  get  high  frequency  sinusoidal  sources  up 
to  10GHz  with  moderate  quality  factor  Q [Mer95,  WuOO],  it  is  difficult  to  implement 
a high  Q inductor  at  less  than  1GHz  band  [Lon97,  Yue98],  Also,  integrated  inductors 
are  not  good  for  self-testing  circuits  due  to  their  frequency  limitations,  bulky  size, 
design  and  manufacturing  difficulties. 

Oscillation  can  be  launched  without  a resonator.  An  example  of  the 
resonatorless  oscillator  is  a ring  oscillator,  as  mentioned  earlier.  Since  the  ring 
oscillator  has  large  open  loop  gain  at  the  zero-phase  frequency,  it  can  oscillate. 
Another  type  of  resonatorless  oscillator  is  a relaxation  oscillator  which  is  shown  in 
Fig.  1.3  [Raz98].  Even  though  the  circuit  has  only  one  storage  element  (the  capacitor 
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C in  Fig.  1.3)  it  can  oscillate,  because  the  positive  feedback  loop  around  Ml  and  M2 
transistors  provides  rapid  switching  at  either  side  of  the  capacitor  C.  This  relaxation 
oscillator  scheme  is  known  to  suffer  from  its  poor  signal  quality  [Raz96], 


Figure  1.3  Resonatorless  relaxation  oscillator.  It  can  oscillate  by  using  the 
positive  feedback  loop  around  Ml  and  M2  transistors,  showing  rapid 
switching  at  either  side  of  the  capacitor  C. 

The  direct  digital  frequency  synthesis  (DDFS)  uses  digital  storage 
elements  and  a digital-to-analog  converter  (DAC)  to  generate  high  precision 
sinusoidal  signals  [BelOO,  Lee98],  A basic  structure  of  DDFS  is  shown  in  Fig.  1.4.  It 
has  three  major  parts,  which  are  an  accumulator,  data  storage,  and  a data  converter. 
The  accumulator  consists  of  an  adder  and  a register,  excluding  an  input  register.  The 
width  of  the  accumulator  determines  the  size  of  the  data  storage  elements,  typically 
read-only  memory  (ROM).  If  a N-bit  accumulator  is  provided,  the  number  of  words 
of  ROM  can  be  up  to  2N.  Also,  the  accumulator  ignores  the  overflow  bit  of  the  adder 
and  goes  back  to  its  lower  address.  Therefore,  it  can  generate  the  circulating  ROM 
addresses  which  represent  the  phase  information  of  the  generated  sinusoidal  waves. 
By  adjusting  the  input  word  value,  the  frequency  of  the  generated  signal  can  also  be 
adjusted. 
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Figure  1.4  Block  diagram  of  direct  digital  frequency  synthesizer  (DDFS) 

The  generated  signal  frequency  is  dependent  on  how  many  times  the 

original  signal  is  sampled  for  one-period  time.  Once  the  number  of  samples  is 

determined,  the  generated  frequency  can  be  changed  by  adjusting  the  offset  value  of 

the  accumulator.  The  minimum  frequency  resolution  of  the  DDFS  can  be  expressed 

as 


r _ ^Clk 
min  “ 


(1.1) 


where  N is  the  width  of  the  accumulator.  Using  this  relation,  output  signal  frequency 
can  also  be  expressed  as 


f = fmin  x (accumulator  offset)=  x (accumulator  offset) 


(1.2) 


By  using  the  offset  for  ROM  access  greater-than-one,  one  can  achieve  a 
higher  frequency  without  increasing  system  clock  frequency,  but  there  will  be  some 
loss  of  signal  quality. 
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1.2.2  Periodic  Bitstream  Method 

Using  a DA  conversion  scheme  to  generate  an  analog  signal  source  comes 
with  one  advantage,  that  is,  digital  technology  without  considering  oscillation-based 
resonance  or  oscillation,  or  some  additional  tricky  analog  issues.  Therefore,  digital 
circuits  tend  to  be  easier  to  design  and  manufacture,  and  more  reliable  to  use.  For  the 
DDFS  approach  mentioned  in  the  previous  section,  data  processing  in  the  front  end 
is  fully  digital  (ROM  address  generation  and  ROM  data  read).  These  parts  are 
straightforward  to  circuit  designers.  The  analog  blocks,  DAC  and  optional  low  pass 
filter  following  the  ROM  determine  the  signal  quality. 

Another  approach  adopting  the  digital  technology  for  analog  source 
generation  has  been  proposed  in  Dufort  and  Roberts  [DufOOa],  which  is  called  a 
periodic  bitstream  method.  It  takes  advantage  of  the  noise-shaping  characteristics  of 
delta-sigma  modulation  and  oversampling  schemes  to  generate  the  digital  data 
representing  high  precision  analog  signals.  The  fundamental  idea  is  to  modulate  a 
digital  sequence  representing  the  desired  analog  signal  into  a 1-bit  data  stream. 
Subsequently,  the  encoded  bitstream  is  replicated  driving  an  analog  noise-shaping 
filter,  which  creates  an  analog  source.  This  successive  procedure  is  shown  in  Fig.  1.5. 

In  order  to  implement  this  signal  generator  on-chip  with  little  hardware 
overhead,  the  bitstream  needs  to  be  repeated  and  reduced  to  a short  length  to 
approximate  the  original  bit  pattern.  An  optimization  process  must  choose  the  best 
length  sequence  out  of  the  infinitely  long  original  sequence. 

1.2.3  Comparison  Of  Existing  Methods  For  Signal  Generation 

Table  1.1  summarizes  the  existing  literature  on  signal  generation.  The  key 
feature  for  embedded  IC  production  test  is  area.  Designers  would  like  all  embedded 
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Figure  1.5  Block  diagram  of  delta-sigma  periodic  bitstream  method 


Table  1.1  Comparison  of  existing  methods  with  this  work 


Technology 

Frequency 

Core  area 

Estimated  0.18pm 
equivalent  area 

PLL-based  work 
[Pam04] 

0.18pm  CMOS 

2.4  ~ 2.48GHz 

2.72x2.47  (mm2) 

6.72  (mm2) 

Frequency  synthesizer 
[Tan95] 

0.8pm  CMOS 

400MHz 

5. 9x6.7  (mm2) 

8.6  (mm2) 

Frequency  synthesizer 
[Cha94] 

1.0pm  CMOS 

0~  13MHz 

2.9x4.9  (mm2) 

3.2  (mm2) 

LC  resonator 
[Cra95] 

0.7pm  CMOS 

1.76GHz 

5.18  (mm2) 

1.33  (mm2) 

Periodic  bitstream  method 
[HafOO] 

0.35pm  CMOS 

15KHz 

(with  20MHz  clock) 

3x3  (mm2) 
(with  external  filter) 

4.6  (mm2) 
(with  external  filter) 

Memory-based  approach 
(this  work  in  chapter  3 & 4) 

0.25pm  BiCMOS 

10  ~ 50MHz 

0.9x0.6  (mm2) 

0.4  (mm2) 

LFSR  approach  (FIRO) 
(this  work  in  chapter  5) 

0.25pm  BiCMOS 

10~80MHz 

0.2x0.2  (mm2) 

0.03  (mm2) 

LFSR  approach  (FIR1) 
(this  work  in  chapter  5) 

0.25pm  BiCMOS 

10  ~ 80MHz 

0.2x0.25  (mm2) 

0.04  (mm2) 
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test  circuits  to  be  the  size  of  an  I/O  pad  (~0.04mm2).  Only  the  periodic  bitstream 
method  proposed  by  Hafed  [HafOO]  and  this  work  are  designed  for  self-testing 
purposes.  Therefore,  the  other  works  described  in  Table  1 . 1 can  generate  only  single- 
tone signals,  and  are  extremely  difficult  to  extend  for  multi-tone  signal  generation. 
In  Hafed’s  periodic  bitstream  method  using  oversampling  delta-sigma  modulation 
[HafOO],  the  circuit  generates  arbitrary  signal  sources  as  well  as  single-tone  or  dual- 
tone sources  by  loading  a different  code  from  external  equipment.  However,  the 
circuit  needs  to  be  initialized  with  the  help  of  external  test  equipment,  and  since  it 
uses  long  delta-sigma  modulated  code  for  analog  signal  extraction,  its  hardware  size 
is  huge  compared  to  this  work.  The  physical  die  size  of  the  periodic  bitstream  method 
implementation  summarized  in  Table  1.1  is  calculated  with  an  external  analog  filter 
for  high  frequency  noise-shaping.  And,  the  sampling  clock  frequency  for  system 
demonstration  is  limited  by  external  equipment  setup. 

1.3  Signal  Source  Requirements  For  Embedded  Test 

Analog  circuits  are  often  driven  by  analog  sinusoidal  signals  or  "symbols" 
composed  of  pieces  of  sinewaves  while  operating  and  being  tested.  A good  frequency 
programmable  analog  sinusoidal  source  is  necessary  to  build  a signal  source  for 
embedded  test  of  an  analog/mixed-signal  circuit. 

Obviously,  testing  features  are  not  the  primary  functions  of  integrated 
circuits,  and  test  circuits  do  not  add  anything  to  IC  performance,  only  to  IC  quality. 
That  means  customers  do  not  want  to  pay  more  for  a production  part  with  increased 
testability.  But  customers  do  demand  fault-free  circuits.  This  is  why  every 
embedded-test  circuit  should  be  as  small  as  possible.  However,  state-of-the-art  high 
performance  analog  or  mixed-signal  circuits  require  high  precision  verification 
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sources.  Programmability  and  multi-tone  generation  is  also  essential  to  test  various 
items,  such  as  frequency  response  and  intermodulation  characteristics. 

An  additional  desirable  feature  is  a calibration-free  system  and 
initialization-free  structure  for  easy  and  low  cost  measurement  setup.  Some  other 
previously-published  research  on  embedded  source  generation  shows  good  results 
[DufOOb],  but  it  requires  time-consuming  initialization  processes  to  set  up  the  on- 
chip  circuits.  This  is  not  a recommendable  situation  considering  the  high  cost  of 
testing  and  test  time  for  manufacturing  test. 

Exploiting  and  recycling  the  existing  hardware  resources  for  test  in  the 
main  IC  must  be  seriously  thought  about  to  minimize  the  IC  design  area.  For 
example,  existing  DACs  or  program  memory,  such  as  EEPROM,  could  be  diverted 
to  a DDFS  approach  for  self-testing,  and  analog  low  pass  filters  employed  in  the  DUT 
are  also  good  candidates  to  do  additional  testing  work. 

Another  issue  associated  with  embedded  test  is  to  guarantee  the  testing 
circuits  against  malfunction.  Failure  in  test  circuit  operation  also  means  100%  failure 
in  test  decision  (whether  or  not  the  IC  is  good).  By  necessity,  the  testing  circuits  are 
always  presumed  to  be  failure-free.  This  is  a reason  the  testing  circuits  must  be  small 
and  simple  to  reduce  the  probable  failure  in  the  testing  circuit  itself. 

1.4  Research  Goals 

The  first  goal  of  this  research  is  to  generate  high  precision  embedded 
signal  sources  for  mixed-signal  ICs.  The  generated  sources  must  be  both  single-tone 
and  dual-tone,  and  also  are  required  to  be  frequency-programmable  for  various 


applications. 
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The  second  goal  is  to  minimize  the  design  overhead  in  incorporating  the 
proposed  embedded  source  generator  into  a mam  mixed-signal  chip,  from  both  area 
standpoint  and  system  performance  standpoint.  Many  other  different  possible 
schemes  to  build  embedded  signal  sources  were  screened  out  because  they  were  not 
practical. 

Finally,  this  work  shows  the  generic  structure  of  an  embedded  self-testing 
scheme.  And,  more  specifically,  an  efficient  substrate  characterization  methodology 
for  analog/mixed-signal  ICs  is  developed  using  embedded  signal  source  generation 
and  detection  circuits.  The  integrated  self-testing  system  for  revealing  substrate 
noise  propagation  behavior  is  reviewed  showing  practical  considerations  and 
identifying  of  potential  problems. 

1.5  Organization  Of  The  Dissertation 

This  Ph.D.  dissertation  has  been  organized  in  six  chapters.  An  overview 
of  this  research  is  given  in  the  current  chapter,  including  the  importance  of  the 
embedded  test,  the  research  goals,  and  the  scope  of  this  work.  Also,  Chapter  1 
discusses  some  existing  approaches  for  conventional  signal  generation  and  provides 
the  basic  ideas,  schematics,  applications  and  strong/weak  points. 

Chapter  2 demonstrates  some  system  level  application  examples 
employing  the  embedded  source  generator.  First,  the  generic  structure  of  embedded 
self-test  architecture  is  presented.  And,  the  application  example  for  characterizing 
the  electrical  behavior  of  the  substrate  is  discussed  using  embedded  signal  source 
generation  and  detection  circuit  blocks. 

In  Chapter  3,  the  memory-based  periodic  bitstream  method  proposed  in 
this  work  is  presented  with  its  conceptual  explanation.  Comparisons  are  made  with 
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prior  research  work  and  the  behavioral  simulation  results  executed  by  MATLAB  are 
presented.  This  chapter  also  talks  about  the  system  division  between  software  and 
hardware,  the  software  model  structure  to  emulate  and  optimize  the  whole  design, 
and  the  hardware  requirements  to  attain  high  precision  signal  sources. 

Chapter  4 discusses  circuit  design  work  to  implement  the  proposed  idea. 
The  chapter  focuses  on  the  analog  sub-circuit  description,  because  the  system 
performance  is  very  dependent  on  the  analog  signal  processing,  including  the  DA 
(Digital-to-Analog)  signal  interface  circuit  and  the  noise-shaping  filter.  The  digital 
circuit  block  contains  a program  ROM,  a shifter  register,  a variable  clock  generator, 
and  an  internal  clock  monitoring  down-counter.  This  chapter  also  discusses  IC 
integration  and  measurement  work,  and  provides  examples  of  captured  raw  data  and 
analysis  results. 

In  Chapter  5,  another  embedded  signal  generation  method  using  a LFSR 
and  a FIR  filtering  technique  is  presented.  This  chapter  includes  a new  approach 
starting  with  a square-wave  pulse,  then  FIR  filtering  methods  to  eliminate  the 
intermediate  harmonic  components  present  in  the  square-wave  signal,  followed  by 
system  performance  estimation  of  an  ideal  case  and  the  FIR  filter  coefficient 
variation  effects.  This  chapter  also  mentions  IC  design  work  including 
implementation  and  simulation,  and  the  measurement  results. 

In  Chapter  6,  the  work  presented  in  this  dissertation  and  its  results  have 
been  summarized,  and  future  work  to  overcome  the  drawbacks  of  the  current  design 


is  discussed. 


CHAPTER  2 

SYSTEM  APPLICATIONS  USING  EMBEDDED  SIGNAL  SOURCES 
2.1  Generic  Structure  for  Mixed-Signal  Embedded  Self-Test 

As  in  recent  system-on-chip  (SOC)  designs,  a large  number  of  transistors 
are  expected  to  be  integrated  onto  a single  substrate.  The  SOCs  contain  many 
different  kinds  of  sub-blocks  or  design  cores,  which  may  be  memory  cores  (e.g., 
ROM,  RAM,  or  any  other  storage  elements),  processing  cores  (CPU  or  DSP  cores), 
data  conversion  blocks  (ADCs  or  DACs),  or  analog  blocks  (operational  amplifiers  or 
analog  filters).  This  core-based  SOC  design  is  similar  to  board-level  system  design. 
The  difference  is  that  SOCs  are  built  by  integrating  individual  design  cores  which  are 
hard  to  test  or  verify  their  operation,  while  board-level  design  is  done  using  fully 
verified  packaged  ICs  or  parts. 

Although  some  of  the  sub-blocks  in  SOCs  are  directly  reachable  from 
standard  I/O  pads,  there  are  still  a lot  of  inaccessible  blocks  in  SOCs.  Owing  to  the 
development  of  built-in-self-test  (BIST)  technology  for  conventional  digital  circuits, 
such  as  scan-based  designs  (e.g.,  level-sensitive  scan  design  (LSSD),  serial  scan, 
partial  serial  scan,  or  parallel  scan  design),  integrated  digital  blocks  can  be  tested  or 
verified  with  acceptable  efforts  [Wes93,  Sha92,  GuoOl]. 

In  the  case  of  analog  or  mixed-signal  designs,  as  mentioned  in  Chapter  1, 
there  is  no  general  solution  for  self-testing.  For  digital  circuits,  input  and  output 
signals  are  defined  at  the  supply  or  ground  voltage  levels,  and  circuit  functionality  is 
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evaluated  by  the  unique  system  clock  synchronizing  the  whole  system.  It  makes  the 
circuit  verification  process  easier  with  the  use  of  automatic  test  pattern  generators 
(ATPGs).  However,  analog  signals  to  be  investigated  are  defined  continuously  in  the 
time-domain  and  the  signal  magnitude  without  allowing  any  round  numbers.  And, 
there  is  a variety  of  signal  requirements  for  analog  or  mixed-signal  block  tests, 
depending  on  the  system  specification  of  target  designs.  There  may  be  a need  for 
single-tone  sources  at  the  audible  frequency  range,  or  multi-tone  sources  at  the  IF 
band. 

Moreover,  it  is  not  easy  to  propagate  an  analog  signal  for  testing  purposes 
from  one  position  to  another  without  being  distorted  by  noise  or  attenuated  by  a lossy 
medium.  While  most  digital  signals  can  travel  anywhere  in  the  same  IC  with  proper 
buffering  and  timing  recovery,  analog  signals  need  to  be  more  careful  in  signal 
distribution.  Analog  circuit  verification  process  has  various  testing  objectives  from 
various  testing  specifications,  such  as  DC  gain  and  offset,  frequency  response,  slew 
rate  or  current  driving  capability,  input  or  output  impedance,  linearity 
characteristics,  noise  immunity,  and  so  on  [BurOl,  Nag95,  Sun99,  Agr96],  Therefore, 
all  the  test  specifications  for  analog  circuit  verification  are  difficult  to  integrate  into 
one  test  environment  or  procedure,  which  is  the  reason  analog  testing  costs  much 
more  than  the  digital  testing  does. 

To  make  the  integrated  analog  circuits  more  accessible  for  test,  additional 
dedicated  data  paths  are  required  to  drive  the  target  circuits  under  test.  Fig.  2.1  shows 
a generic  structure  for  embedded  test  of  mixed-signal  ICs.  The  analog  blocks  are 
partitioned  independently,  and  each  divided  circuit  blocks  can  be  accessible  and 
tested  individually.  The  signal  path  isolation  is  very  important  to  prevent  the 
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unwanted  signal  coupling  or  power  loss.  Especially,  radio  frequency  (RF)  circuit 
testing  needs  more  sophisticated  isolation  techniques  due  to  its  vulnerableness  to 
mismatches  and  crosstalk. 

As  shown  in  Fig.  2.1,  reliable  signal  detection  circuits  as  well  as  high 
quality  signal  sources  should  also  be  embedded  together  to  make  a reasonable  test 
decision  on  the  current  output.  The  reliability  of  mixed-signal  embedded  self-test 
depends  on  well-defined  signal  sources,  reasonable  test  result  decision  schemes,  and 
effective  isolation  techniques. 


SYSTEM  INPUT 


► Path  for  self-testing  SYSTEM  OUTPUT 

► Path  for  normal  operation 


Figure  2.1  Generic  block  diagram  of  mixed-signal  embedded  test  structure. 
(Digital  circuit  blocks  are  not  drawn  here.) 
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2.2  Substrate  Noise  Characterization 

The  advent  of  state-of-the-art  deep  submicron  complementary  metal-oxide 
semiconductor  (CMOS)  technology  has  made  it  possible  to  integrate  many  different 
digital  and  analog  blocks  onto  a single  chip,  as  in  SOCs.  This  may  dramatically 
reduce  the  system  manufacturing  cost,  physical  area,  and  power  consumption  while 
enhancing  overall  performance.  However,  one  of  the  major  challenges  in  mixed- 
signal  IC  design,  especially  SOC  design  is  the  integration  of  both  noisy  digital  blocks 
and  noise-sensitive  analog  circuits  on  the  same  substrate.  This  results  in  substrate 
noise  and  crosstalk  problems.  As  the  device  feature  sizes  decrease  and  system  clock 
frequency  for  digital  components  increases,  digital  switching  noise  increases 
remarkably,  degrading  the  analog  circuit  performance. 

2.2.1  Sources  of  Substrate  Noise 

When  the  logic  state  of  the  digital  CMOS  gates  toggles,  the  current  flows 
into  or  out  of  the  load  capacitance,  injecting  the  noise  current  into  the  substrate. 
Three  major  noise  currents  should  be  considered;  1)  capacitively  coupled  current 
through  the  load  capacitance  (Cload)  of  the  logic  gates,  2)  impact  ionization  current 
at  the  drain  end  of  a MOSFET,  and  3)  current  due  to  voltage  bounce  on  the  supply/ 
return  rails  [Don03,  AlbOl]. 

The  load  capacitance  of  digital  logic  gates  can  be  decomposed  into  the 
drain  junction  capacitance  (Cjd)  at  the  output  node,  routing  metal  capacitance 
connected  to  the  output,  and  gate-to-substrate  capacitance  (Cgs)  belonging  to  the 
inputs  of  subsequent  logic  components.  Since  the  capacitance  per  unit  area  and 
periphery  for  Cgs  is  a few  times  larger  than  Cjd  in  modern  CMOS  technology  using 
thin  gate-oxide,  the  load  capacitance  Cload  is  dominated  by  the  gate  capacitance  Cgs 
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and  routing  metal  capacitance  [NagOl].  Although  the  routing  is  one  of  the  big  design 
issues  in  current  million  transistor  integration  for  digital  ICs,  routing  metal 
capacitance  can  be  reduced  remarkably  by  limiting  the  use  of  the  lowest  metal  wires 
with  more  capacitance  for  short  interconnection  nets  with  the  availability  of  multi- 
level routing  metal  for  long  nets.  Therefore,  the  greater  part  of  capacitively  coupled 
currents  flows  directly  into  the  source  node  of  MOS  transistors  through  Cgs,  and  the 
rest  of  the  current  is  injected  into  the  substrate. 

Impact  ionization  current  is  created  by  electron/hole  pairs  exposed  to  a 
strong  electric  field  along  the  channel  of  a MOS  transistor.  The  hole  current 
generated  from  split  pairs  flows  into  the  substrate  via  the  nearest  substrate  contact 
while  the  flux  of  the  electrons  penetrates  into  the  gate  oxide.  Although  this  current 
grows  much  larger  for  smaller  gate-length  transistors,  it  is  small  compared  to 
capacitively-coupled  current  mentioned  earlier  when  switching  frequency  is  larger 
than  10MHz  as  in  sub-micron  CMOS  technologies  [BriOO]. 

When  the  digital  gates  change  their  logic  states,  a spike  current  flows 
through  the  power/ground  lines.  Due  to  the  parasitic  inductance  of  the  package  and 
bond  wires,  the  spike  current  gets  reflected  back  into  the  original  circuit,  creating 
voltage  spikes  on  the  power  and  ground  lines,  known  as  “ground  bounce”  [Fel99] . 
This  supply  noise  is  injected  into  the  substrate  via  substrate  contacts.  Once  noise  is 
present  in  the  substrate,  it  can  cause  fluctuations  in  the  bulk  node  voltage  of  the 
transistors  built  on  the  same  substrate,  as  shown  in  Fig.  2.2. 

2.2.2  Current  Researches  on  Substrate  Noise 


5.2.2. 1 Substrate  Noise  Reduction  and  Prevention 
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Analog  Vdd  Digital  Vdd 


Figure  2.2  The  substrate  interaction  model  of  the  mixed-signal  design 

To  reduce  the  substrate  noise  effects,  a variety  of  design  decisions  can  be 
made  before  the  design  data  is  taped  out  for  fabrication,  including  both  reducing 
substrate  noise  itself  and  protecting  target  sensitive  circuits  from  the  noisy  circuit 
blocks.  One  example  is  reducing  the  parasitic  package  inductance  visible  to  the 
power  supply  lines  by  using  more  expensive  packages  or  by  using  parallel  bonding 
of  the  package  pins  to  the  power  supply. 

Another  technique  to  reduce  the  substrate  noise  is  to  use  a non-EPI  process 
technology  where  the  substrate  is  more  resistive.  In  an  EPI  process,  the  substrate  is 
heavily  doped  (low  resistance)  and  then  behaves  like  a single  node  at  analog 
frequencies.  Therefore,  the  sensitive  analog  circuit  blocks  can  not  be  efficiently 
isolated  using  layout  techniques.  To  protect  against  substrate  noise  using  a non-EPI 
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process,  separate  power  supplies,  guard  rings,  or  a Kelvin  referencing  technique 
shown  in  Fig.  2.3  can  be  used  [NagOl],  However,  non-EPI  process  technologies 
increase  the  risk  of  latch-up  problems  in  densely  packed  modern  digital  ICs. 

To  protect  sensitive  analog  circuits,  they  can  be  isolated  from  the  substrate 
either  by  oxides  via  silicon-on-insulator  (SOI)  substrate  with  trenches  used  for 
device  isolation  [Col97],  or  by  PN  junctions  of  the  triple  well  technology  [Ros03], 
Although  this  may  work  properly  for  prevention  of  substrate  noise  propagation  at  low 
frequencies,  capacitive  coupling  allows  high  frequency  components  to  pass  through 
[Jor94], 


Vdd  (circuit) 


Vdd  (well  bias) 


Gnd  (substrate  bias) 


Gnd  (circuit) 


Figure  2.3  An  illustration  of  Kelvin  referencing  interconnection.  Because  the 
resistive  connections  between  the  bouncing  supply/return  rails  and 
the  well/substrate  are  eliminated,  it  can  be  effective  for  low  frequency 
noise.  However,  since  the  parasitic  capacitance  of  the  source  and 
drain  diffusion  of  MOSFETs  forms  coupling  paths,  this  scheme  does 
not  work  reasonably  at  higher  frequency. 

5. 2. 2.2  Substrate  Modeling  and  Extraction 


Another  research  direction  on  dealing  with  substrate  noise  is  to  make 
substrate  models  that  predict  fluctuation  mechanisms  and  their  propagation  to  other 
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circuit  blocks,  and  then  to  estimate  these  effects  by  simulating  the  substrate  noise 
with  other  circuits  early  in  the  design  stage.  To  simulate  substrate  noise,  an  accurate 
substrate  model  is  necessary.  These  substrate  models  vary  from  simple  lumped- 
element  models  to  complicated  electromagnetic  models  [Hei99,  Su93,  Gha96],  Fig. 
2.4  shows  an  extracted  example  of  a substrate  model  of  a twin-well  CMOS  process 
with  P-epi  layer.  In  Nagata  and  Iwata  [Nag99],  a mesh  type  resistor  model  was 
introduced  to  explain  substrate  coupling  in  analog  and  digital  mixed  large  scaled- 
integration  (LSI)  designs. 


P well 


P-  epi 


P+  bulk 


N well 


Figure  2.4  An  example  of  substrate  model  in  a twin-well  CMOS  technology  with 
P-  epi  layer.  Since  the  P+  bulk  is  heavily  doped,  its  parasitic 
resistance  can  be  ignored. 

In  addition  to  the  substrate  modeling  approach,  various  extraction 
methods  of  substrate  circuit-level  behavior  or  netlist  from  the  actual  layout  have  been 
studied.  In  Schrik,  Genderen,  and  Meijs  [Sch03],  a methodology  of  coherent 
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interconnect/substrate  modeling  at  the  chip  level  was  presented.  The  method  has 
been  implemented  using  the  layout-to-circuit  extractor  SPACE,  and  extractions 
result  in  electrical  circuit  models  which  can  be  simulated  with  HSPICE.  Although  the 
simulation  results  which  are  demonstrated  using  a 9-section  ring  oscillator  provide 
practical  insight  in  the  substrate  noise  generated  by  transistors  and  its  interconnect, 
it  is  too  hard  to  use  this  technique  for  larger  designs,  such  as  SOCs,  because  the 
circuit-level  simulation  can  not  apply  to  the  more  complicated  design  containing 
millions  of  transistors. 

Similar  extraction  approach  is  introduced  in  Haroun  [Har03],  which  uses 
process  information  (e.g.,  wafer  thickness  and  doping  profile),  circuit  information 
(e.g.,  SPICE  netlist,  noise  sources,  or  package  parasitics),  and  layout  GDSII  data  to 
generate  RC  SPICE  netlist  for  substrate  model.  Since  this  approach  extracts  final 
layout  data,  however,  its  application  area  is  also  limited  to  small-sized  designs. 

From  both  Schrik  [Sch03]  and  Haroun  [Har03],  accurate  substrate  and 
interconnect  models  or  circuit-level  netlists  can  be  found  and  simulated.  However, 
one  common  disadvantage  in  both  works  is  that  they  need  the  final  layout  data  to 
extract  substrate  and  interconnect  information  to  investigate  the  substrate  noise  and 
coupling  behavior.  If  the  design  specification  is  not  met  at  this  step,  all  the  circuit 
design  and  layout  procedures  have  to  be  repeated  until  they  pass  the  required  check 
items,  which  delays  design  turn-around-time  (TAT)  and  increases  manufacturing 
cost. 

To  examine  the  substrate  noise  characteristics  in  an  earlier  design  stage, 
simulation  should  be  carried  out  at  the  same  time  with  circuit  level  design  before 
physical  layout  work.  A methodology  was  proposed  to  accurately  evaluate  substrate 
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noise  in  cell-based  integrated  circuit  (CBIC)  design  [Cha99],  This  approach  exploits 
the  fact  that  any  given  logic  gate  injects  a particular  signal  into  the  substrate  through 
capacitive  coupling.  The  basic  idea  is  that  the  substrate  noise  of  the  entire  circuit  can 
be  evaluated  using  the  substrate  injection  patterns  of  each  basic  library  cell  and  the 
precise  switching  activity  information  in  the  internal  nodes.  The  substrate  noise  is 
computed  as  the  sum  of  the  contributions  of  all  the  gates  and  nodes  for  a specific 
input  vector  in  the  circuit  of  interest  [Mil96],  This  method  enables  the  designers  to 
investigate  the  substrate  noise  characteristics  at  the  circuit-level  design  stage  without 
the  necessity  of  completion  of  the  full  chip  layout. 

However,  the  substrate  noise  injection  depends  not  only  on  the  input 
transition  pattern  but  also  on  the  load  configuration,  which  drives  substrate  noise 
injection  pattern  of  each  of  the  cells  to  be  parameterized  with  respect  to  the  load 
capacitance.  Another  difficulty  is  that  it  takes  too  much  effort  to  apply  this  scheme 
to  non-CBIC  design  database.  Since  many  SOCs  are  composed  of  several 
heterogeneous  circuit  blocks,  such  as  DSP  cores,  memory  blocks,  glue  logic  of 
random  gates,  data  conversion  circuits,  or  pure  analog  circuits,  individual  sub-blocks 
will  probably  not  be  cell-based  design.  For  non-CBIC  blocks,  it  requires  a large 
effort  to  get  the  basic  information  about  substrate  injection  patterns. 

2.2.3  Proposed  Substrate  Modeling  Method  and  Simulation  Scheme 

Although  many  researchers  have  been  trying  to  develop  a accurate 
substrate  model  using  various  approaches,  including  a numerical  finite-difference- 
based  methods  and  quasi-analytical  approaches  [Sta94,  Sme94,  Don03],  a complete 
analysis  of  substrate  noise  requires  both  extraction  of  the  substrate  configuration  as 
well  as  simulation.  Fig.  2.5  (a)  shows  the  output  perturbation  process  from  spike 
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Figure  2.5  Analog  circuit  performance  degradation  caused  by  digital  noise  on  the 
ground  line;  (a)  noise  propagation  path,  and  (b)  its  simplified  model. 
Since  the  parasitic  effects  from  bonding  wires  and  lead  frames  in  (a) 
are  modeled  as  resistance  and  inductance,  this  path  is  assumed  to  be 
open  for  high  frequency  noise. 

current  caused  by  state  transition  of  digital  circuits.  This  signal  path  can  be 
decomposed  to  two  different  systems,  which  are  the  substrate  and  the  ground-to- 
output  responses  of  analog  circuits,  as  shown  in  Fig.  2.5  (b).  In  Fig.  2.5  (b),  the  noise 
input  represents  all  the  noise  components  present  in  the  digital  ground  line,  and  the 
noise  output  represents  the  signal  generated  by  the  transfer  function  of  cascaded 
systems,  Hsub(to)  and  Hgnd(oo). 
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The  output  signal  of  analog  circuits  implemented  on  mixed-signal  ICs 
consists  of  the  following. 

1)  Signals  processed  by  ideally-operating  circuits  with  pure  analog  input 

signals,  which  can  be  calculated  in  the  frequency-domain  as 

H(co)  x [analog  input  signal] , where  H(co)  is  the  transfer  function  derived  from  the 
input  to  the  output. 

2)  Signals  caused  by  unstable  power  supply  and  ground  lines.  Ignoring  the 
noise  on  the  power  supply  line,  this  portion  can  be  expressed  as 
Hgnd(co)  x [analog  ground  noise  signal] , where  Hgnd(co)  is  defined  as  the  transfer 
function  of  analog  circuit  itself  from  the  ground  node  to  the  output  node. 

3)  Signals  determined  by  intrinsic  noise  of  device  itself,  device  mismatch, 
offset  voltage,  and  other  unexpected  environmental  change. 

To  get  the  best  analog  system  performance,  part  1)  of  the  analog  output 
should  be  protected  against  from  part  2)  and  part  3)  mentioned  above.  Part  2)  can  be 
controlled  or  estimated  by  power  supply  rejection  ratio  (PSRR)  of  the  circuit  of 
interest,  and  part  3)  is  out-of-scope  for  consideration  in  this  section.  As  shown  in  Fig. 
2.5,  the  substrate  noise  affecting  the  analog  circuit  operation  originates  from 
Hsub(co)  as  we^  as  Hgnd(co).  Since  the  gain  reduction  of  Hgnd(co)  is  another  design 
issue,  this  work  concentrates  on  the  investigation  of  Hsub(co). 

The  goal  of  this  work  is  identifying  the  substrate  characteristics  through 
Hsub(ci>),  and  applying  the  substrate  behavior  to  circuit  simulation  by  adding  the 
substrate  noise  effect,  as  in  part  2)  mentioned  before.  The  electrical  behavior  of 
substrate  is  extremely  difficult  to  define  as  a specific  model,  because  it  varies  as  the 
following  conditions  change. 

1)  substrate  structure  (e.g.,  epi  process  vs.  non-epi  process)  and  physical 
dimension  (e.g.,  thickness) 

2)  substrate  doping  density  and  profile 

3)  the  number  of  substrate  contacts  and  its  electrical  model 
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4)  distance  from  a noise  source  to  a destination 

5)  noise  source  behavior  (e.g.,  magnitude  or  frequency) 

Therefore,  it  is  reasonable  to  start  with  the  real  measurement  data 
characterizing  the  substrate  rather  than  with  derivation  of  the  physical  behavior  of 
substrate  material.  In  other  words,  in  analog  circuit  calibration  and  performance 
estimation,  knowledge  of  the  noise  propagation  characteristics  of  the  substrate  is 
required,  not  what  the  substrate  looks  like  in  the  form  of  electrical  model.  The 
substrate  of  interest  can  be  regarded  as  a “black  box”,  and  it  may  be  analyzed  with 
well-known  specific  input  signals  to  build  a look-up  table  explaining  its  behavior. 
Then,  the  substrate  can  be  referred  as  a look-up  table,  not  an  artificially-built 
electrical  model. 

Fig.  2.6  shows  the  proposed  structure  explaining  the  substrate  noise 
measurement  and  output  signal  reconstruction  of  analog  circuits  using  an  embedded 
signal  source.  First,  noise  signal  behavior  caused  by  digital  circuits  should  be 
identified  using  transient  simulations.  Since  the  simulation  results  in  the  time- 
domain  is  not  easy  to  analyze  and  reuse  in  the  following  steps,  it  is  converted  to  the 
frequency-domain  by  the  Fourier  Transform.  Therefore,  the  outcome  of  this  step  is 
the  frequency  spectrum  behavior  of  the  digital  circuit  noise.  In  the  next  step,  the 
electrical  characteristics  of  the  substrate  where  the  digital  and  analog  circuits  are 
integrated  together  should  be  investigated  using  variable  frequency  embedded  signal 
sources,  and  the  corresponding  results  are  tabulated  to  build  a database  for 
referencing  the  substrate  noise  propagation  characteristics. 

With  the  digital  noise  spectrum  and  the  measurement  data  about  noise 
propagation  of  the  substrate,  the  noise  signal  at  the  analog  ground  node  propagated 
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Signal  Source 
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Output  signal  spectrum 
generated  by  substrate  noise 


Figure  2.6  Proposed  process  flow  for  analog  output  estimation  from  substrate 
noise.  (Shaded  steps  are  redrawn  in  Fig.  2.7  for  details.) 

from  the  digital  ground  node  via  the  intervening  substrate  can  be  calculated  or 

estimated.  Because  of  the  parasitic  inductance  and  resistance  of  bonding  wires  and 

lead  frames,  as  shown  in  Fig.  2.5  (a),  this  noise  signal  on  the  analog  ground  line  will 

directly  drive  the  main  analog  circuit  blocks.  The  detailed  procedure  to  get 

reconstructed  analog  ground  line  noise  is  shown  in  Fig.  2.7. 

To  figure  out  the  ground  noise  rejection  of  the  target  analog  circuits,  the 

transfer  function  Hgnd(co)  from  the  analog  ground  node  to  the  analog  output  of 
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reconstructed  substrate  noise 
propagated  to  the  victim  circuit 


Figure  2.7  Digital  circuit  noise  characterization  procedure  using  substrate 
measurement  data  and  sub-block  placement  information  of  full  chip 
floorplan.  Reconstructed  substrate  noise  above  can  be  combined  with 
ground-to-output  transfer  function  Hgnd(co)  of  the  victim  analog 

circuits,  as  shown  in  Fig.  2.6,  to  calculate  the  output  signal  spectrum 
of  the  victim  circuit. 
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interest  is  derived  or  simulated.  Using  Hgnd(co)  and  reconstructed  substrate  noise  at 
the  analog  ground  node,  the  output  signal  of  analog  circuits  can  be  estimated,  and  this 
result  may  be  used  for  calibration  of  analog  block  performance  or  for  design  change 
of  involved  circuits. 

2.2.4  Practical  Considerations 
5.2.4. 1 Digital  Noise  Characterization 

Most  of  mixed-signal  ICs  consist  of  large  digital  circuit  blocks  and  small 
analog  circuits,  caused  by  the  popularity  of  embedded  central  processing  unit  (CPU) 
cores  and  digital  signal  processing  techniques;  the  digital  circuit  blocks  execute  data 
processing  while  the  analog  blocks  take  over  data  conversion  or  external  interface. 
Therefore,  sensitive  analog  circuit  blocks  are  placed  on  the  mixed-signal  silicon 
wafer,  surrounded  by  huge  noisy  digital  circuits.  That  means  the  digital  noise  source 
interfering  against  analog  circuits  are  ubiquitous  in  the  whole  wafer. 

The  ground  node  noise  characterization  of  digital  circuit  can  be  done  by 
transistor-level  transient  simulation  using  user-specific  input  vectors.  However,  this 
time-domain  simulation  of  the  entire  digital  circuits  may  be  infeasible  both  in  terms 
of  simulation  time  and  hardware  requirements.  Fortunately,  hierarchical  structure  of 
huge  mixed-signal  ICs  (e.g.,  SOCs)  can  help  divide  the  integrated  digital  circuits  into 
several  sub-blocks  to  simulate  and  characterize  individually.  Also,  instead  of  using 
SPICE,  fast  circuit  simulator  which  is  optimized  only  for  power  dissipation  analysis 
(e.g.,  POWERMILL  [SynOO])  can  be  used  to  extract  the  ground  node  fluctuation 
behavior. 

To  make  a simple  digital  noise  source  model  avoiding  unnecessary 
complexity,  it  is  required  to  approximate  the  noise  sources  as  simplified  sources,  as 
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shown  in  Fig.  2.8.  In  Fig.  2.8  (a),  the  analog  ground  node  is  affected  by  several  digital 
noise  sources,  and  the  interference  effects  are  dependent  on  the  distance  from  the 
specific  noise  source  and  the  intensity  of  interfering  noise  signals.  To  determine  and 
approximate  the  distance  and  noise  component  amplitude,  transistor-level  simulation 
results  and  sub-block  placement  information  of  a full  chip  layout  are  required. 
Especially,  to  quantify  the  noise  source  amplitude  caused  by  digital  logic  blocks,  the 


substrate 


substrate 


Figure  2.8  Two-dimensional  illustration  of  digital  noise  source  approximation. 

(a)  analog  ground  node  is  interfered  by  several  digital  noise  sources, 
and  (b)  real  digital  noise  sources  can  be  replaced  by  virtual  noise 
sources. 
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amount  of  short-circuit  current  at  the  instance  the  state  is  toggling  should  be 
identified  through  the  transient  simulation. 

In  Fig.  2.8  (b),  several  digital  noise  sources  are  replaced  by  virtual  noise 
sources  which  are  implemented  as  an  embedded  signal  source  generator.  Since  the 
practical  digital  noise  signals  might  be  having  multi-tones  in  the  frequency  spectrum, 
virtual  noise  sources  may  be  needed  to  synthesize  with  several  single-tone  sources. 
It  is  extremely  difficult  to  get  a close  approximation  of  real  digital  noise  sources  from 
finite  number  of  embedded  signal  sources,  and  real  digital  noise  sources  which  are 
spread  all  over  the  whole  chip  area  can  not  be  implemented  as  equivalent  embedded 
signal  sources.  Therefore,  to  make  an  approximated  model  shown  in  Fig.  2.8  (b),  the 
followings  may  be  considered. 

(1)  Spatial  approximation:  The  closer  digital  logic  blocks  to  the  victim 
circuit  make  much  more  serious  substrate  noise  and  interference  as  shown  in  Fig.  2.9 
(a). 
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Figure  2.9  Substrate  noise  approximation  models  of  mixed-signal  ICs.  (a)  spatial 
approximation  model  in  which  closer  blocks  to  the  victim  circuit  more 
seriously  affect  the  substrate,  and  (b)  current  density  approximation 
model  in  which  more  power-consuming  blocks  make  more  fluctuation 
through  the  substrate. 
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(2)  Current  density  approximation:  The  bigger  transistors  flow  more 
transient  current  (e.g.,  short-circuit  current)  from  the  power  supply  to  the  ground.  For 
example,  clock  drivers  or  signal  buffers  generate  much  more  severe  substrate 
fluctuation  than  the  minimum-sized  random  logic  transistors  do.  This  is  illustrated  in 
Fig.  2.9  (b). 

(3)  Frequency  approximation:  Due  to  the  randomness  of  unpredictable 
digital  noise  sources,  it  is  not  easy  to  simulate  practical  digital  noise  using  pre- 
defined signal  sources.  However,  considering  many  clocked  digital  systems  are 
running  via  the  synchronized  system  clock,  a frequency-domain  approximation  can 
be  made.  That  is,  the  digital  noise  of  the  ground  substrate  is  generated  with  a 
frequency  two  times  faster  than  a system  clock  and  its  harmonics. 

5. 2.4. 2 Substrate  Characterization 


Unlike  the  previous  research  which  concentrates  on  building  a electrical 
model  using  lumped  circuit  elements  [Su93,  Gha96,  Nag99],  the  proposed  method 
starts  from  experimentally  measured  data  with  known  signal  sources  and  real 
substrate  material  to  build  a “look-up  table”.  This  look-up  table  includes  the 
following  items 

(1)  A set  of  input  signal  sources  approximating  the  digital  noise  signal 

(2)  The  corresponding  detected  signal  which  is  propagated  from  the  signal 
injection  point  via  the  substrate  material 

(3)  Measured  distance  between  the  signal  injection  and  detection  locations 

(4)  Additional  layout  structures  to  improve  the  substrate  noise  immunity 
of  analog  circuits  (for  example,  guard  rings  or  deep  trench  structures  protecting  the 
analog  blocks  from  the  digital  circuits.)  Also,  input  and  output  signals  are  classified 
according  to  the  its  amplitude  and  frequency. 

For  a signal  source  to  drive  the  substrate  node,  the  input  source  generated 
by  embedded  signal  source  generator  is  directly  injected  to  the  substrate  via  substrate 
contacts,  as  in  the  case  of  digital  transition  noise.  The  fluctuated  substrate  node 
signal  is  picked  up  with  proper  contacts  by  detection  circuits,  and  measured  to  get 
the  corresponding  information  in  the  frequency  spectrum.  Various  measurement 
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conditions,  such  as  input  magnitude,  frequency  and  the  location  of  detection  circuits, 
enrich  the  experimental  information  on  the  substrate  behavior.  Thanks  to  the 
cumulative  property  of  the  look-up  table,  it  will  have  more  abundant  information 
with  more  experimental  measurement.  Also,  it  may  be  a part  of  intellectual  properties 
(IP)  associated  with  a specific  process  technology,  because  it  characterizes  the 
substrate  behavior,  which  is  a matter  of  concern  to  many  IC  designers. 

5.2.4. 3 Substrate  Noise  Estimation  at  the  Analog  Ground  Node 

Since  the  electrical  behavior  of  the  substrate  material  under  certain 
condition  has  not  been  completely  known,  it  can  be  modeled  as  a non-linear  system. 
And,  to  make  the  analysis  simple,  it  is  also  assumed  to  be  time-invariant.  The 
substrate  noise  signal  at  the  analog  ground  node  can  be  estimated  by  looking  up  the 
measurement  database  characterizing  the  substrate  behavior.  With  the  information  of 
digital  ground  noise  signal  (such  as  frequency  spectrum)  and  block  placement  data 
of  layout,  the  substrate  noise  look-up  table  formed  from  actual  measurement  data  is 
accessed  to  find  or  approximate  the  amount  of  substrate  noise  propagated  from 
digital  ground  line  to  analog  ground  node  of  interest. 

The  accuracy  of  this  estimated  value  is  dependent  on  the  precision  of  input 
conditions.  First,  the  resolution  of  the  look-up  table  may  be  one  of  the  sources 
causing  mis-estimation  of  substrate  noise.  If  the  information  of  the  digital  circuit 
ground  noise  found  from  transient  circuit  simulation  is  not  present  in  the  substrate 
look-up  table,  the  closest  data  should  be  chosen  to  represent  the  analog  ground  signal 
fluctuations  caused  by  digital  ground  noise.  This  situation  may  happen  in  the 
frequency  or  signal  magnitude,  as  illustrated  in  Fig.  2.10. 
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simulated  digital  ground  noise 
in  the  frequency-domain 
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Figure  2.10  The  look-up  table  resolution  problem.  In  the  case  the  desired 
component  can  not  be  found,  interpolated  value  from  the  look-up 
table  may  cause  inevitable  reading  errors. 

Another  difficulty  in  using  a look-up  table  method  comes  with  the  digital 
noise  signal  having  multi-tone  components.  Since  substrate  characterization  is  done 
with  various  single-tone  sources  to  build  a measurement  database,  multi-tone  digital 
noise  will  make  the  whole  procedure  complicated  due  to  the  linearity  approximation. 
For  example,  suppose  the  digital  ground  noise  shown  in  Fig.  2.10.  The  noise  signal 
can  be  approximated  as  a dual-tone  having  component  Nj  and  N2,  ignoring  the  noise 
floor.  Given  the  function  of  y - fsub(x)  describing  the  substrate  behavior,  where  x 
is  injected  signal  to  the  substrate  and  y is  outcoming  signal  from  the  substrate  at  the 
desired  location,  it  is  known  from  the  substrate  look-up  table  that  yN  = fsub(Nj)  and 
Yn2  = fsub(N2)  when  x = N,  and  x = N2,  respectively.  If  x = (Nj+N2)  is 
injected  to  the  substrate  having  linear  operation  characteristics,  the  outcoming  signal 
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will  be  y^Ni  + n^  = fSub(Ni  + N2)  = fsub(N.)  + fsub(N2)  • However,  if  the  substrate  is 
not  linear,  the  output  is  no  longer  [fsub(Nj)  + f.ub(N2)]  • 

5.2.4. 4 Physical  Implementation 

To  build  the  substrate  noise  characterization  system,  a signal  detection 
circuit  as  well  as  a source  generation  circuit  is  also  required.  A two-dimensional 
structure  to  drive  the  substrate  with  embedded  signal  sources  and  detect  its 
outcoming  signal  is  shown  in  Fig.  2. 1 1 . To  protect  the  target  substrate  under  test  from 
other  secondary  unwanted  noisy  sources,  a “Deep  Trench”  guardring  is  required.  The 
reliable  measurement  is  guaranteed  when  both  the  signal  source  generation  and 
detection  circuits  are  working  correctly  without  any  interference  from  the 
neighboring  unstable  substrate.  The  robust  design  of  source  generation  and  detection 
circuit  against  the  substrate  fluctuation  is  also  important  to  improve  the  measurement 
system  reliability. 


Figure  2.11  A two-dimensional  structure  of  the  substrate  noise  characterization 
system.  A “Deep  Trench”  guardring  is  used  to  protect  the  substrate 
under  test  from  other  secondary  unwanted  sources  and  neighboring 
unstable  substrate. 
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The  signal  source  propagation  path,  which  is  formed  from  the  source 
generation  circuit  to  the  detection  circuit  via  the  substrate,  is  also  important  factor 
affecting  the  detected  results.  The  signal  path  should  be  short  and  free  from  the 
possible  coupling  problems.  The  substrate  noise  measurement  circuit  should  be 
located  far  away  from  the  main  circuit  blocks  to  avoid  unnecessary  interference. 

2.3  Summary 

In  this  chapter,  system  application  examples  using  embedded  signal 
sources  have  been  discussed.  First,  the  generic  structure  for  mixed-signal  embedded 
self-test  has  been  mentioned  in  section  2.1.  The  target  design  under  test  is  divided 
for  easy  access  and  analysis,  and  tested  individually.  More  specifically,  substrate 
noise  characterization  example  using  embedded  signal  source  generation  and 
detection  circuits  has  been  discussed  in  section  2.2.  This  embedded  testing  scheme 
helps  the  reliable  mixed-signal  design  by  providing  the  efficient  way  of  considering 
the  substrate  noise  propagation  during  early  circuit  design  stage  without  the 
necessity  of  completed  layout  work. 


CHAPTER  3 

EMBEDDED  SOURCE  GENERATION  USING  MEMORY-BASED  PERIODIC 

BITSTREAM 

3.1  Introduction 

In  order  to  avoid  unnecessary  calibration  and  extra  tuning  circuits,  a 
digital  design  methodology  is  required  for  embedded  signal  generation.  In  addition, 
digital  circuit  blocks  are  very  robust  and  immune  to  many  process  variations  and 
temperature  drift  effects.  This  design  philosophy  leads  one  to  the  digital-to-analog 
(DA)  conversion  approach,  not  a resonator-based  approach.  DA  conversion  schemes 
are  much  easier  to  control,  and  delta-sigma  modulation  is  characterized  by  its  high 
precision  performance,  as  shown  in  Fig.  3.1  [Rab99,  Tay98],  Therefore,  analog-to- 
digital  (AD)  modulation  is  implemented  by  using  the  delta-sigma  modulation 
method. 

Since  the  AD  modulation  process  needs  analog  signal  inputs  to  be 
modulated,  only  the  DA  demodulation  part  is  implemented  on  the  chip,  while  the  AD 
modulation  is  performed  in  a software  model.  All  the  necessary  blocks  are  on  the 
same  IC,  including  a clock  generator,  a noise-shaping  filter,  code-storage  ROM,  etc. 
No  external  programming  is  required  after  fabrication;  this  is  done  in  the  code- 
storage ROM.  A down-counter  is  added  to  monitor  the  internal  clock  speed,  which  is 
critical  to  control  the  frequency  of  the  generated  signal. 

In  this  chapter,  an  embedded  signal  source  generation  method  using 
oversampling  delta-sigma  modulation  scheme  is  presented.  Overall  behavior  of  the 
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signal  source  generation  system  is  modeled  using  MATLAB  to  simulate  and  estimate 
the  system  performance.  This  chapter  describes  the  basic  idea  of  the  proposed 
memory-based  periodic  bitstream  method  for  signal  source  generation,  and  compares 
the  major  difference  between  this  work  and  original  periodic  bitstream  approach. 
Finally,  the  design  variables  that  affect  the  final  resultant  signal  quality  and  the 
required  hardware  to  implement  the  whole  design  are  specified. 


Figure  3.1  Current  ADC  performance  comparison  (after  [Wal99]). 

3.2  Basic  Concept 

The  proposed  memory-based  periodic  bitstream  approach  is  based  on  the 
conventional  DDFS  [BelOO]  and  the  periodic  bitstream  method  [DufOOa].  For  the 
DDFS  approach,  internal  memory  components  do  not  have  to  be  loaded  from  the 
external  source,  because  they  are  already  programmed  while  being  manufactured. 
With  the  initial  phase  information  and  the  on-chip  accumulator  available,  the  storage 
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elements  are  automatically  accessed  to  generate  digital  data  sampled  from  an  ideal 
sine-wave.  The  issue  here  is  how  to  convert  this  multi-bit  digital  data  back  to  an 
analog  signal. 

For  the  periodic  bitstream  method,  its  shift  chain  registers  should  be  pre- 
loaded  with  the  modulated  data  generated  externally.  In  this  case,  we  can  make  1-bit 
modulated  data  by  using  a delta-sigma  modulation  scheme.  Multi-bit  modulation 
undoubtedly  produces  much  better  performance  but  with  increased  hardware  size  and 
much  more  difficulty  in  design  and  in  manufacture.  Also,  oversampling  techniques 
give  us  decreased  quantization  noise  within  the  bandwidth  of  interest. 

A 1-bit  modulation  structure  enables  us  to  have  a good  linearity  data 
conversion  characteristic  and  more  relaxed  design  specifications  for  the  DAC  block. 
In  other  words,  using  a low  pass  filter  (with  proper  signal  level  scaling  to  interface 
the  analog  block),  we  can  extract  desired  signal  sources  from  a 1-bit  data  stream 
without  a dedicated  DAC  block.  This  is  very  desirable,  considering  the  difficulty  in 
designing  a highly  linear  multi-bit  DAC.  By  incorporating  both  a variable  clock 
generator  and  a tunable  low  pass  filter,  the  output  frequency  of  generated  signal  can 
be  adjusted  without  loading  different  modulated  code  on  the  storage  component.  This 
relationship  is  illustrated  in  Fig.  3.2. 

Fig.  3.3  shows  the  block  diagram  structure  of  the  proposed  system;  the 
basic  operation  is  as  follows.  Software  models  the  AD  modulation  process  from  an 
ideal  source  and  the  DA  demodulation  process  realized  in  hardware  outputs  an 
analog  source.  As  shown  in  Fig.  3.3,  the  AD  process  is  assigned  to  software  and  the 
DA  operation  is  done  in  the  hardware  section.  This  configuration  requires  no  pre- 
load of  modulated  data  from  an  off-chip  instrument.  Software  models  of 
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Figure  3.2 


Constitution  of  proposed  memory-based  periodic  bitstream  method. 


Figure  3.3  Block  diagram  of  proposed  memory-based  periodic  bitstream  method. 
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oversampling  delta-sigma  modulation  can  generate  1-bit  aperiodic  bitstreams.  This 
infinitely  long  bitstream  has  a high  pass  noise  transfer  function,  so  by  doing  noise- 
shaping by  using  a low  pass  filter,  the  original  signal  is  restored  from  the  1-bit  digital 
stream.  We  must  choose  the  subset  of  the  continuous  bitstream  to  generate  a 
comparable  result  and  repeat  a chosen  bitstream.  Since  an  infinite  bitstream  cannot 
be  stored,  bitstream  optimizing  software  selects  the  best  subset  out  of  original  delta- 
sigma  modulated  data.  Several  design  factors  affect  the  final  result,  which  will  be 
described  in  section  3.4. 

The  outcome  from  the  software  model  is  stored  in  an  on-chip  memory 
element  as  done  in  a DDFS  approach,  while  the  chip  is  being  manufactured.  The 
shifter,  which  is  configured  as  a rotator,  is  loaded  with  the  contents  of  on-chip 
memory  before  rotating  its  contents.  This  loaded  data  could  represent  single-tones  or 
multiple  tones  depending  on  the  software  model.  The  fundamental  signal  frequency 
is  determined  by  the  shifter  clock  frequency  and  oversampling  ratio  (OSR).  Given  a 
variable  clock  source  to  trigger  the  shifter,  a programmable  signal  source  is 
constructed.  Furthermore,  1-bit  digital  comb  filter  eliminates  the  even  harmonic 
components  associated  in  the  digital  bitstream,  which  helps  increase  the  spurious- 
free  dynamic  range  (SFDR).  Although  dominant  even  harmonics  caused  by  a 
repetitive  digital  bitstream  can  be  removed  in  the  single-tone  case,  other  even 
harmonic  components  coming  from  the  analog  filter's  distortion  can  not  be 
eliminated.  The  noise-shaping  analog  filter  must  have  a tunable  cutoff  frequency  for 
a wider  range  of  programmability  in  the  fundamental  frequency.  This  tunable  cutoff 
frequency  can  filter  out  the  noise  near  the  fundamental  frequency. 
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3.3  Comparison  With  Original  Periodic  Bitstream  Method 

The  basic  idea  of  the  proposed  memory-based  periodic  bitstream  method 
[EisOl,  Cho03]  and  original  periodic  bitstream  method  [DufOOb]  are  the  same, 
(however,  the  frequency  range  for  this  work  is  MHz  while  the  previous  work  was 
KHz.)  They  both  use  an  oversampling  technique  to  reduce  the  quantization  noise  in 
the  band  of  interest  and  delta-sigma  modulation  to  push  the  unwanted  noise  to  a high 
frequency  band.  Though  both  methods  have  the  same  starting  point,  they  are  aiming 
at  different  target  applications. 

For  a periodic  bitstream  method  which  has  length  N and  sampling 
frequency  fs,  the  output  signal  consists  of  the  following  frequency  bins, 

f.„,  = (3D 

where  M = 0,  1,  2,...,  (N/2).  Available  frequency  bins  using  this  approach  are  shown 
in  Fig.  3.4  (a).  By  combining  these  usable  frequency  bins  selectively,  the  original 
periodic  bitstream  method  can  generate  arbitrary  waveforms  as  well  as  single-tone  or 
dual-tone  waveforms.  However,  to  make  room  for  a wide  enough  frequency  band  to 
accommodate  n frequency  bins,  the  circuit  needs  a very  long  bitstream  to 
approximate  the  original  delta-sigma  modulation  output.  Also,  since  the  OSR  is 
determined  by  the  highest  frequency  bin,  which  is  fn  in  the  example  shown  in  Fig. 
3.4  (a),  a much  higher  speed  clock  frequency  is  required. 

Suppose  the  specific  measurement  environment  requires  simple 
waveforms,  not  arbitrary  waveforms.  In  this  case,  a lot  of  frequency  bins  are  wasted 
with  the  above  approach,  because  they  are  never  used.  The  proposed  memory-based 
periodic  bitstream  method  uses  only  2 frequency  bins,  as  shown  in  Fig.  3.4  (b). 
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PSD(f) 


(a) 


(b) 

Figure  3.4  Typical  spectrums  of  (a)  original  periodic  bitstream  method  [DufOOb] 
and  (b)  this  work 

Implementation  of  the  power  spectrum  shown  in  Fig.  3.4  (b)  enables  us  to  save 
frequency  bandwidth,  shorten  the  bitstream  length,  and  increase  the  signal  frequency 
generated  using  the  same  system  clock  as  in  Fig.  3.4  (a).  Drastically  shortened 
bitstreams  could  be  programmed  in  an  embedded  memory  at  manufacturing  stage,  as 
this  work  demonstrates.  This  eliminates  the  initialization  step  of  loading  modulated 
code  before  normal  signal  generation  operation.  An  implementation  example  using 
the  power  spectrum  is  shown  in  Fig.  3.4  (a),  Hafed  and  Abaskharoun  [HafOO]  using 
1024-bit  long  bitstream  shows  about  61dB  SFDR  of  15KHz  single-tone  generation  at 
a sampling  rate  of  20MHz.  Furthermore,  in  order  to  get  improved  signal  quality,  a 
periodic  bitstream  method  using  much  longer  bitstream  and  even  multi-bit  resolution 
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(not  1-bit)  has  been  attempted  [DuffOOb],  Table  2.1  summarizes  advantages  and 
disadvantages  of  each  approach  discussed  so  far. 


Table  2.1  Comparison  between  this  work  and  Dufort  and  Roberts  [DufOOb] 


Dufort  and  Roberts  [DufOOb] 

This  work 

advantages 

► arbitrary  waveform 

► loose  noise-shaping  spec. 

► passive  filtering  for  noise- 
shaping 

► single/dual-tone  generation 

► shorter  bitstream  length 

► no  initial  code  loading 
►high  frequency  generation 
at  given  clock 

disadvan- 

tages 

► longer  bitstream  length 

► initial  code  loading 

► lower  frequency  genera- 
tion at  given  clock 

► tough  noise-shaping  spec. 

► no  arbitrary  waveform 
►tunable  filter  for  noise- 
shaping 

► variable  clock  source 

3.4  Modulation 

As  shown  in  Fig.  3.3,  the  bitstream  to  be  programmed  in  embedded 
memory  is  modulated  by  an  oversampling  delta-sigma  method,  which  generally  has 
an  infinitely  long  and  aperiodic  bitstream  output.  However,  the  whole  bitstream 
cannot  be  stored  with  finite  hardware  resources.  Dufort  and  Roberts  [Duf97]  show 
that  by  selecting  a certain  portion  of  the  output  stream  of  an  original  oversampling 
delta-sigma  modulator  and  repeating  this  chosen  output,  it  is  possible  to  get  a good 
approximation  of  the  output  coming  out  of  the  infinite  bitstream  implementation. 

There  is  a coherence  problem  in  selecting  a truncated  bitstream.  If  the 
selected  samples  represent  an  exact  one  cycle  or  its  multiple  of  any  periodic  wave, 
multiple  repetition  of  these  samples  can  produce  the  original  wave  that  wraps  around 
from  one  cycle  to  another.  Fig.  3.5  illustrates  the  coherence  problem  in  the  case  of 
sinewave.  In  Fig.  3.5  (a),  the  sampled  values  at  tO,  tl,  t2,  and  t3  could  represent  one 
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Figure  3.5  Sinewave  examples  showing  (a)  no  coherence  problem,  and  (b) 
coherence  problem. 

cycle  of  sinewave  when  they  are  repeated,  because  the  signals  starting  at  tO  and  t4 
are  in-phase.  But,  this  is  not  the  same  in  Fig.  3.5  (b),  because  tO  and  t4  are  out-of- 
phase each  other. 

3.4.1  Oversampling  Delta-Sigma  Modulation 

According  to  the  Sampling  theorem  [Opp99],  in  order  for  a band-limited 
baseband  signal  which  has  a zero  power  spectrum  for  greater  than  f^  to  be 
reconstructed  without  any  information  loss,  it  must  be  sampled  at  a rate  fs  > 2fb . This 
2fb  frequency  is  called  the  Nyquist  frequency  (fN),  and  the  Nyquist  sampling  means 
the  sampling  at  this  Nyquist  rate  fN.  If  the  sampling  frequency  fs  is  less  than  fN, 
which  is  called  “undersampling”,  the  original  information  will  be  corrupted  by  the 
frequency  aliasing  (Fig.  3.6  (a)).  In  the  case  of  Nyquist  sampling  (fs=fN),  the  original 
data  can  be  restored  with  ideal  low  pass  filtering,  as  shown  in  Fig.  3.6  (b).  Fig.  3.6 
(c)  shows  the  “oversampling”  case,  which  is  fs>fN  (typically,  fs  is  much  greater  than 
fN).  With  oversampling,  no  information  is  lost,  and  also  no  additional  information  is 
gained. 

From  Fig.  3.6  (b)  and  (c),  however,  we  can  infer  a couple  of  advantages  of 
oversampling  techniques.  First,  they  relax  the  requirements  placed  on  the  analog 
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anti-aliasing  filter.  The  design  specification  of  the  low  pass  filter  in  Fig.  3.6  (b)  is 
very  hard  to  meet,  due  to  its  extremely  narrow  transition  band.  But  the  oversampling 
technique  keeps  the  adjacent  aliasing  copies  of  baseband  information  far  from  the 
baseband  frequency.  Therefore,  unlike  as  in  the  Nyquist  sampling,  the  anti-aliasing 
filter  of  the  oversampling  case  does  not  require  a sharp  transition  band  characteristic. 
And  the  farther  location  of  aliasing  components  from  the  baseband  components 
makes  the  filter  order  decreased,  while  keeping  the  same  SNR. 

Another  advantage  associated  with  oversampling  is  extra  dynamic  range, 
which  can  be  obtained  by  spreading  the  quantization  noise  power  over  a larger 
frequency  range.  Since  the  quantization  noise  is  dependent  on  the  quantization  level 
and  is  independent  of  the  sampling  frequency  [Joh97],  it  can  be  approximated  as  a 
white  noise  in  the  frequency  domain  (Fig.  3.7  (a)).  The  shaded  area  means  the  total 


Figure  3.6  Frequency  spectrums  explaining  (a)  undersampling,  (b)  Nyquist 
sampling,  and  (c)  oversampling. 
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quantization  noise  power,  and  the  absolute  value  at  any  frequency  can  be  express  as 
follows  [Rab99], 


k = 


_A 

Jl2 


(3.2) 


where  A is  the  difference  between  two  adjacent  quantization  levels. 

Given  two  different  sampling  frequencies  fsl  and  fS2  (fsl  < fs2)  on  the  same 
baseband  signal  fb  and  the  quantization  level,  the  frequency  response  can  be 
illustrated  as  in  Fig.  3.7  (b)  and  (c).  From  the  above  relation,  we  know  that  the  faster 
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Figure  3.7  Frequency  spectrums  explaining  quantization  noise,  (a)  white  noise 
approximation  of  quantization  error,  and  the  oversampling  effects  on 
quantization  noise  of  two  different  sampling  frequencies  (b)  fsl  » fb 

(C)fs2>fsl- 
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sampling  frequency  we  have,  the  smaller  quantization  noise  we  get  at  any  specific 
frequency.  It  can  be  explained  from  the  total  noise  power  standpoint.  Because  the 
total  quantization  noise  powers  of  both  cases  are  same,  the  dynamic  range  of  Fig.  3.7 
(c)  is  larger  than  one  of  Fig.  3.7  (b). 

Oversampling  ratio  (OSR)  is  defined  to  express  the  ratio  of  the  sampling 
frequency  to  the  Nyquist  frequency, 


where  fs  is  the  sampling  frequency,  fN  is  the  Nyquist  frequency,  and  fb  is  the 
baseband  frequency.  The  maximum  SNR  can  be  expressed  using  OSR  [Joh97], 

SNRmax  = 6.02N  + 1.76  + lOlogOSR  (3.4) 

where  N is  the  number  of  bits  in  the  quantizer.  We  see  that  doubling  the  OSR 
increases  3dB/octave  in  SNR,  which  is  equivalent  to  the  0.5bits/octave. 

From  the  Eq.(2.4),  we  know  that  the  dynamic  range  could  be  improved  as 
the  sampling  frequency  or  the  number  of  bits  in  the  quantizer  is  increased.  If  we  add 
noise  shaping  techniques  using  feedback,  which  is  called  a feedback  modulator,  to 
this  oversampling  structure,  we  can  achieve  a much  higher  dynamic  range  with  the 
same  quantizer  and  the  sampling  frequency. 

Fig.  3.8  shows  a linear  model  of  general  feedback  modulator  [Rab99j. 
Input  X is  an  analog  signal,  and  output  Y is  a quantized  digital  signal.  GA(z)  and  F(z) 
represent  a forward  and  a backward  transfer  function,  respectively.  Eq  is  an  injected 
ADC  error  caused  while  quantization,  and  ED  is  a DAC  error  representing  non- 
linearity, gain  or  offset  error.  If  the  gain  of  backward  transfer  function  (F(z))  is  large, 
as  in  delta  modulators  or  interpolative  modulators,  it  amplifies  baseband  DAC  errors 
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relative  to  the  input  signal,  which  causes  implementation  difficulties.  Another 
subclass,  called  a delta-sigma  modulator,  has  unity  feedback  gain  to  avoid  this 
problem.  Its  simplified  linear  model  is  shown  in  Fig.  3.9. 

Considering  X and  Eq  as  two  independent  inputs  in  Fig.  3.9,  we  can  define 
the  signal  transfer  function  STF(z)  and  noise  transfer  function  NTF(z)  as  follows, 


STF(z) 


Y(z)=  H(z) 
X(z)  1 + H(z) 


(3.5) 


Figure  3.8  Linear  model  of  a feedback  modulator.  GA(z)  and  F(z)  represent  a 
forward  and  a backward  transfer  function,  respectively.  Eq  is  an 
injected  ADC  error  caused  while  quantization,  and  ED  is  a DAC  errors 
representing  non-linearity,  gain  or  offset  error. 
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Figure  3.9  Simplified  linear  model  of  a delta-sigma  modulator,  which  has  unity 
feedback  gain  to  avoid  excessive  amplification  of  the  baseband  DAC 
errors  in  Fig.  3.8. 
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NTF(z)  = 

Eq(z)  1 + H(z) 


That  is,  output  Y(z)  can  be  expressed, 


(3.6) 


Y(z)  = STF(z)X(z)  + NTF(z)Eq(z)=  (-  “ffiz)X(z)  + , + ‘ Eq(z))  (3.7) 

If  we  take  H(z)  such  that  its  magnitude  is  large  over  the  frequency  band  of 
interest,  we  can  remove  the  quantization  noise  Eq  in  the  baseband.  Although  the  in- 
band  noise  is  drastically  decreased  with  proper  choice  of  H(z),  high  frequency  noise 
is  not  reduced,  but  increased.  However,  low  pass  filter  following  the  modulator  stage, 
which  is  called  noise-shaping  filter,  can  remove  these  remnants.  The  noise  power  in 
baseband  obviously  decreases  as  the  order  of  NTF(z)  increases  [Joh97], 

3.4.2  Software  Implementation 

As  mentioned  in  Chapter  1,  the  periodic  bitstream  method  repeats  the  pre- 
selected set  of  data  infinitely  to  generate  a desired  signal.  Then,  the  question  is  how 
it  relates  the  number  of  bits  of  the  selected  bits  and  OSR  to  the  signal  quality 
generated. 

An  AD  modulation  is  implemented  using  MATLAB  to  estimate  the  whole 
system  performance.  Fig.  3.10  shows  the  behavioral  model  structure  of  this  proposed 
scheme.  First,  an  ideal  sinusoidal  wave  is  generated  with  desired  magnitude  and 
phase  information,  which  could  be  single-tone  or  multi-tone.  This  sinusoidal  wave  is 
sampled  in  the  time-domain,  and  its  sampled  value  is  stored  for  subsequent  delta- 
sigma  modulation.  Sampling  is  controlled  through  the  sampling  frequency  and  the 
OSR,  that  is,  these  two  factors  determine  the  signal  frequency  to  be  generated.  A 
delta-sigma  modulation  module  creates  an  infinitely  long  aperiodic  digital  bitstream 
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Figure  3.10  MATLAB  behavioral  model  of  signal  generation. 
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from  the  sampled  analog  wave.  The  finite  bitstream  to  be  programmed  in  embedded 
memory  is  chosen  out  of  this  delta-sigma  modulated  data,  depending  the  starting  bit 
location  (SBL)  search  mode.  Once  the  finite  bitstream  has  been  determined,  it  is 
repeated  to  generate  the  periodic  digital  bitstream,  which  looks  different  from  the 
original  aperiodic  delta-sigma  modulated  data.  After  digital  comb  filtering  to  reduce 
some  of  the  even  harmonic  components,  it  is  analyzed  in  the  frequency-domain 
through  a fast  Fourier  Transform  (FFT).  Generated  signal  quality  is  indicated  by 
SFDR  or  signal-to-noise  ratio  (SNR). 

Since  the  original  modulated  output  data  does  not  have  periodicity  in  its 
bit  pattern,  there  are  many  different  cases  for  sub-bitstream  selection.  Furthermore, 
the  magnitude  and  the  phase  of  an  input  source  used  for  modulation  could  change  the 
modulated  output  [Cho03].  Considering  all  factors  affecting  the  signal  quality,  it  is 
not  straightforward  to  pick  the  best  sequence  of  finite  length  of  bitstream  from  the 
original  bitstream.  This  bitstream  optimization  requires  large  computing  effort  to 
simulate  and  review  all  possible  cases  changing  the  input  variables.  Fig.  3.11  shows 
the  design  variables  of  optimization  process. 

3.4,2. 1 Amplitude  And  Phase  Of  An  Input  Sinewave 

By  varying  the  amplitude  and  the  phase  of  the  input  sinewave  in  the 
modulation  model,  different  results  are  obtained,  as  illustrated  in  Fig.  3.12.  Only 
amplitude  or  phase  difference  could  make  the  different  output  bitstream.  Fig.  3.13 
and  Fig.  3.14  show  how  the  signal  quality  is  changed  with  varying  condition  of  input 
amplitude  and  phase.  Signal  quality  is  measured  as  SFDR,  which  is  the  magnitude 
difference  between  the  fundamental  and  the  largest  harmonic  component  (typically 
the  third  harmonic  component).  Fig.  3.13  (a),  (b),  and  (c)  are  obtained  with  different 
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Figure  3.11  Design  variables  of  bitstream  optimization  for  second-order  delta- 
sigma  case. 

SBLs  and  the  number  of  output  bitstream.  Obviously,  as  the  input  amplitude  changes, 
the  SFDR  the  selected  bitstream  produces  is  also  changing  and  its  pattern  is  not 
predictable.  The  same  thing  is  done  to  see  how  the  SFDR  varies  with  varying  input 
phase.  As  in  Fig.  3.13,  Fig.  3.14  (a),  (b),  and  (c)  are  obtained  with  different  SBLs  and 
the  length  of  output  bitstream.  Similarly,  a different  value  of  input  phase  makes  a 
different  result,  which  is  also  unpredictable.  Fig.  3.13  (d)  and  Fig.  3.14  (d)  tell  us 
how  fine  resolution  is  required  to  find  the  best  SFDR. 

3.4.2, 2 OverSampling  Ratio  (OSR)  And  The  Number  Of  Selected  Bitstream 

As  described  earlier,  oversampling  techniques  drastically  reduce  the 
quantization  noise  power  in  the  baseband,  while  the  total  noise  power  remains 
unchanged.  Therefore,  SNR  could  be  much  better  with  higher  OSR,  as  seen  in  Eq. 
(2.4).  Although  SFDR  is  not  directly  related  to  SNR,  because  it  is  concerned  with  the 
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Figure  3.12  Different  results  made  from  (a)  different  amplitudes  and  (b)  different 
phases  of  input  sinewaves. 
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0SR=16,  phase=0,  ic_x  1=1.0,  ic_x2=0.98,  SBL=2571,  128  bits  & No  Comb  filtering 


(a) 


0SR=16,  phase=0,  ic_x1=1.0,  ic_x2=0.98,  SBL=238,  128  bits  & No  Comb  filtering 


Figure  3.13  MATLAB  simulation  with  varying  input  amplitude;  (a)  SFDR 
simulation  results  when  OSR  is  16,  input  phase  is  0 radian,  initial 
conditions  for  xl  and  x2  are  1.0  and  0.98,  SBL  is  2571,  128-bit  stream 
is  used  without  comb  filtering,  (b)  SFDR  simulation  results  when 
OSR  is  16,  input  phase  is  0 radian,  initial  conditions  for  xl  and  x2  are 
1.0  and  0.98,  SBL  is  238,  128-bit  stream  is  used  without  comb 
filtering,  (c)  SFDR  simulation  results  when  OSR  is  16,  input  phase  is 
0 radian,  initial  conditions  for  xl  and  x2  are  1.0  and  0.98,  SBL  is  238, 
64-bit  stream  is  used  without  comb  filtering,  (d)  SFDR  simulation 
results  with  same  condition  as  in  (c)  and  finer  resolution  (1000-point 
resolution  at  0.001  step). 
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0SR=16,  SBL=238,  phase=0,  ic_x1=1.0,  ic_x2=0.98,  64-bit  stream  & No  Comb  filtering 


(c) 


OSR=16,  SBL=238,  phase=0,  ic_x1=1.0,  ic_x2=0.98,  64-bit  stream  & No  Comb  filtering  (1000-points  resolution  at  [0.900-0.901]) 


(d) 


Figure  3.13  Continued. 
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0SR=16,  magnitude=0.9,  Ic_x1=1.0,  ic_x2=0.98,  SBL=2571,  128  bits  & No  Comb  filtering 


(a) 


OSR=16,  magnitude=0.92,  ic_x  1=1.0,  ic_x2=0.98,  SBL=238,  128  bits  & No  Comb  filtering 


(b) 

Figure  3.14  MATLAB  simulation  with  varying  input  phase;  (a)  SFDR  simulation 
results  when  OSR  is  16,  input  magnitude  is  0.9,  initial  conditions  for 
xl  and  x2  are  1.0  and  0.98,  SBL  is  2571,  128-bit  stream  is  used 
without  comb  filtering,  (b)  SFDR  simulation  results  when  OSR  is  16, 
input  magnitude  is  0.92,  initial  conditions  for  xl  and  x2  are  1.0  and 
0.98,  SBL  is  238,  128-bit  stream  is  used  without  comb  filtering,  (c) 
SFDR  simulation  results  when  OSR  is  16,  input  magnitude  is  0.92, 
initial  conditions  for  xl  and  x2  are  1.0  and  0.98,  SBL  is  238,  64-bit 
stream  is  used  without  comb  filtering,  (d)  SFDR  simulation  results 
with  same  condition  as  in  (c)  and  finer  resolution  (1000-point 
resolution  at  0.01  radian  step). 
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0SR=16,  SBL=238.  magnitude=0.92,  ic_x1=1.0,  ic_x2=0.98,  64-bit  stream  & No  Comb  filtering 
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OSR=16,  SBL=238,  magnitudes. 92,  ic_x1=1.0,  ic_x2=0.98,  64-bit  stream  & No  Comb  filtering  (1000-point  resolution  at  [3.30-3.31 


(d) 


Figure  3.14  Continued. 
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largest  harmonic  component  power,  not  the  total  noise  power,  simulation  results 
show  that  SFDR  gets  higher  as  OSR  increases.  This  relation  is  shown  in  Fig.  3.15, 
which  is  simulated  with  different  SBLs  (=2,  238,  3897).  Though  there  is  a couple  of 
counter-indicating  data  points  (A  and  B),  the  overall  result  shows  a tendency  to 
increase  with  increasing  OSR.  Similar  phenomena  like  this  SFDR  reduction  can  be 
also  seen  in  Fig.  3.13  and  Fig.  3.14. 

Given  the  fixed  speed  of  a sampling  clock  and  the  fundamental  frequency, 
higher  OSR  requires  longer  sub-bitstreams  for  the  modulation  output.  Higher  OSR 
and  longer  bitstreams  definitely  make  better  signal  quality  at  the  expense  of  extra 
chip  area. 


Figure  3.15  MATLAB  simulation  result  showing  the  relationship  between  SFDR 
and  OSR.  Despite  a couple  of  exception  points  (A  and  B),  in  general, 
higher  OSR  makes  higher  SFDR. 
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3.4.2. 3 Starting  Bit  Location  (SBL)  Of  Selected  Bitstream 

Another  important  design  factor  in  choosing  the  best  bitstream  is  the 
starting  bit  location  (SBL)  of  selected  bitstream.  This  effect  on  SFDR  is  simulated 
using  MATLAB,  and  the  result  is  shown  in  Fig.  3.16,  which  tells  how  SFDR  varies 
as  the  SBL  changes.  In  Fig.  3.16  (a),  the  amplitude  and  the  phase  of  an  input 
sinewave  are  set  to  0.9  and  0 radian,  respectively,  and  in  Fig.  3.16  (b),  they  are 
assigned  to  0.95  and  n radian,  respectively.  For  both  cases,  128  bits  are  selected  to 
generate  repetitive  bit  pattern,  and  SBL  is  swept  from  1 to  2000.  From  the  Fig.  3.16 
(a)  and  (b),  any  similarity  or  regularity  between  them  can  not  be  found,  even  though 
each  result  reveals  some  repetitive  behavior  after  SBL  is  500.  Also,  it  should  be  noted 
that  case  (a)  has  better  SFDR  than  case  (b)  does.  In  order  to  find  better  starting 
conditions  than  in  the  case  in  Fig.  3.16  (a),  however,  more  simulation  work  is  needed. 

3. 4.2.4  Initial  Values  Of  Internal  Registers  In  The  Delta-Sigma  Modulator 

Other  research  on  analog  source  generation  using  periodic  bitstream 
methods  has  mentioned  several  design  factors  which  must  be  considered  for 
bitstream  optimization,  and  these  include  input  magnitude,  phase,  and  sampling 
position  [Duf97],  As  explained  in  section  3.3,  however,  Dufort  and  Roberts  [DufOOb] 
have  different  design  specifications  than  this  work  and  makes  use  of  a much  longer 
bitstream  for  higher  OSR. 

In  addition  to  optimization  of  design  variables  mentioned  above,  it  was 
discovered  that  the  SFDR  of  the  generated  signal  can  be  affected  by  initial  conditions 
of  internal  registers  of  the  delta-sigma  modulator.  Since  a 2nd  order  modulation  was 
done,  there  can  be  two  controllable  initial  conditions.  Fig.  3.17  shows  a 3-D  graphic 
plot  explaining  the  variation  of  SFDR  versus  two  initial  conditions.  This  result  is  also 
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OSR=16,  magnitudes. 9,  phase=0,  ic_x1=1.0,  ic_x2=0.98,  128  bits  & No  Comb  filtering 
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0SR=16,  magnitudes. 95,  phase=n,  ic_x1=1.0,  ic_x2=0.98,  128  bits  & No  Comb  filtering 


(b) 

Figure  3.16  MATLAB  results  showing  SFDR  variation  vs.  SBL;  (a)  SFDR 
simulation  results  when  OSR  is  16,  input  magnitude  and  phase  are  0.9 
and  0 radian,  initial  conditions  for  xl  and  x2  are  1.0  and  0.98,  128-bit 
stream  is  used  without  comb  filtering,  (b)  SFDR  simulation  results 
when  OSR  is  16,  input  magnitude  and  phase  are  0.95  and  n radian, 
initial  conditions  for  xl  and  x2  are  1.0  and  0.98,  128-bit  stream  is 
used  without  comb  filtering. 
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variable  with  different  input  conditions,  such  as  input  magnitude  and  sampling 
position.  Since  this  behavior  looks  chaotic  and  also  unpredictable,  it  is  extremely 
difficult  to  find  regularity  in  its  appearance.  Under  the  same  design  conditions, 
higher  order  modulation  adds  more  initial  conditions  to  be  simulated,  which  means 
additional  simulation  overhead  to  find  the  optimal  sequence. 


OSR=64,  bitstream  [1500  - (1500+127)],  mag_sine=0.549,  & No  Comb  filtering 


Figure  3.17  MATLAB  result  showing  SFDR  variation  vs.  initial  conditions  of 
internal  register  of  delta-sigma  modulator. 

3. 4. 2. 5 Bitstream  Optimization 

Besides  the  six  design  variables  shown  in  Fig.  3.11,  amplitude  and  phase 
of  the  input  sinewave,  OSR,  initial  conditions  of  internal  registers,  SBL,  and  number 
of  selected  bitstreams,  one  has  to  consider  the  effect  of  changing  the  modulation 
order  on  the  generated  signal  quality.  It  is  known  that  higher  order  delta-sigma 
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modulation  reduces  noise  in  the  baseband  [Joh97].  In  other  words,  the  SNR  of  a 
higher  order  modulation  system  is  higher  than  that  of  a lower  order  modulation 
system.  But,  the  same  thing  does  not  happen  to  SFDR,  because  SFDR  is  determined 
by  dominant  harmonic  component(s),  not  the  total  noise  power.  Furthermore,  the 
memory-based  periodic  bitstream  approach  proposed  in  this  dissertation  adopts  an 
approximate  delta-sigma  modulation  by  some  post-processing  after  the  original 
delta-sigma  modulation.  This  approximate  approach  loses  some  inherent  properties 
of  original  modulation  (which  is  predictable). 

In  order  to  select  the  best  bitstream,  a lot  of  design  factors  should  be 
considered.  Since  there  are  six  design  variables  which  affects  the  output  quality, 
optimization  software  must  sweep  six  variables  to  see  their  effects.  However,  we  can 
eliminate  some  variables  using  reasonable  assumptions.  Bitstream  length  can  be 
fixed  with  a pre-defined  fundamental  frequency,  if  only  one  period  of  the  signal  is 
modulated.  Therefore,  increasing  OSR  results  in  increasing  the  bitstream  length  to 
be  sampled,  and  OSR  and  bitstream  length  become  dependent  on  each  other.  And,  to 
control  the  offset  phase  of  the  input  sinewave,  SBL  can  be  used.  Phase  information 
of  any  sinusoidal  waveform  indicates  delay  in  the  time-domain,  and  SBL  is  also 
interpreted  as  the  delay  of  sampling  data.  Given  a fixed  phase  of  an  input  sinewave, 
varying  the  sampling  location  can  be  regarded  as  changing  the  phase  information  of 
input,  which  is  illustrated  in  Fig.  3.18.  Therefore,  a bitstream  optimization  routine 
can  combine  OSR  and  the  number  of  bits,  and  ignore  the  input  phase  variable. 

One  more  thing  to  be  considered  is  the  non-linearity  characteristic  of  the 
analog  noise-shaping  filter  which  follows  the  digital  processing  part.  Also,  because 
the  analog  filter  is  vulnerable  to  mismatch  effects  as  well  as  temperature  drift  and 


63 


Figure  3.18  Illustration  of  phase  shift  by  changing  SBL.  The  input  phase  variable 
can  be  ignored  during  bitstream  optimization  process  by  varying  the 
SBL. 

process  variation,  it  is  not  easy  to  achieve  high  performance  and  good  noise 
characteristics.  It  is  difficult  to  design  and  build  high-frequency  continuous-time 
filter  circuits  with  SNR  better  than  60dB  [Joh97],  Considering  all  the  limitations  of 
the  system  parts  to  be  integrated  together,  too  high  a SFDR  in  the  front  stage  is  not 
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beneficial,  if  the  post-processing  analog  circuits  can  not  handle  the  signal  without 
degradation  during  its  operation. 

Fig.  3.19  shows  the  bitstream  optimization  procedure  used  in  this  work. 
As  mentioned  above,  OSR  and  bitstream  length  are  combined  into  one  variable,  and 
the  phase  information  of  the  input  is  replaced  by  the  SBL.  Also,  a random  search  is 
used  for  scanning  SBL  to  find  the  best  location.  Since  embedded  hardware  size  is 
dependent  on  bitstream  length,  OSR  and  bitstream  length  must  be  determined  first. 
Bitstream  length  directly  determines  on-chip  program  memory  size  and  shift  register 
length.  Once  these  variables  are  fixed,  the  MATLAB  optimizing  routine  scans  the 
SBL  at  random  with  an  arbitrary  amplitude  of  input  sinewave.  A preliminary  SBL 
chosen  in  step  2 is  used  to  find  the  temporary  input  amplitude.  Step  3 and  step  4 are 


step  1 


step  2 


step  3 


step  4 


step  5 


step  6 


Pick  OSR  & (number  of  bits) 


Pick  preliminary  SBL 


Pick  preliminary  sinewave  amplitude 


Pick  SBL 


Pick  sinewave  amplitude 


Pick  icxl  & ic  x2 


Figure  3.19  Bitstream  optimization  flow  implemented  by  using  MATLAB.  Based 
on  the  behavioral  MATLAB  model  for  signal  generation  shown  in 
Fig.  3.10,  this  optimization  routine  finds  the  best  design  parameter 
values  to  produce  the  highest  SFDR. 
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repeated  with  finer  resolution  to  pick  the  final  values.  In  step  6,  initial  conditions  are 
swept  to  choose  the  best  value  with  other  pre-determined  variables. 

3.5  Demodulation 

Demodulation  is  implemented  in  a monolithic  circuit,  and  consists  of  a 
digital  processing  block  and  an  analog  filtering  block.  The  final  stage  for  noise 
shaping  must  be  realized  by  an  analog  circuit  to  get  the  noise-suppressed  sinusoidal 
signal. 

DA  conversion  process  is  composed  of  data  storage,  data  repetition, 
digital  processing,  signal  level  scaling,  and  low  pass  filtering  circuits.  Data  storage 
is  required  to  store  the  modulated  code  which  is  generated  from  the  software  model. 
The  actual  hardware  size  of  the  digital  part  is  dependent  on  the  size  of  the  modulated 
code  to  be  programmed,  but  in  this  1-bit  approach,  the  data  storage  element  size  is 
small,  compared  with  other  circuits.  The  data  repetition  block  generates  the  actual 
periodic  bitstream  from  the  finite  pre-programmed  code,  after  being  loaded  with  it 
from  a data  storage  block.  Since  the  OSR  is  fixed,  the  data  repetition  speed 
determines  the  frequency  of  the  generated  signal.  From  the  definition  of  OSR,  we 
know  that  the  signal  frequency  is  linearly  proportional  to  the  sampling  frequency. 
Here,  the  sampling  frequency  means  the  inverse  of  the  duration  time  for  which  the 
digital  data  stays  stable,  ignoring  random  noise  and  under/overshoot.  Therefore,  we 
can  regard  the  sampling  frequency  as  a system  clock  frequency  or  operating  speed  of 
data  repetition.  The  faster  the  data  repeats,  the  higher  the  generated  output  frequency 
goes.  This  is  why  high-speed  digital  design  and  a high-speed  clock  generators  are 
required  for  the  digital  block  implementation. 
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The  digitally  regenerated  bitstream  by  a data  repetition  block  is  now 
periodic  and  repetitive.  Even  though  it  is  not  really  delta-sigma  modulated  code,  it 
approximates  delta-sigma  modulation  when  selected  by  bitstream  optimizing 
software.  Despite  a good  optimized  code  approximation,  this  approach  inevitably 
suffers  from  various  noise  sources  associated  with  other  elements  located  in  data 
flow  path.  Fig.  3.20  shows  the  situation  of  noise  superposition.  In  Fig.  3.20  (a),  the 
noise  components  are  gradually  growing  as  the  signal  travels  through  the  signal  path. 
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Figure  3.20  Illustration  of  noise  superposition  (a)  without  a comb  filter,  and  (b) 
with  a comb  filter.  Only  harmonic-based  components  are  considered 
as  noise,  and  other  non-harmonic  based  components  are  ignored.  A 
comb  filter  inserted  between  the  data  repetition  block  and  the  analog 
filter  can  reduce  the  even  harmonics  accumulated  at  the  output  of  data 
repetition,  but  the  even  harmonic  components  generated  by  the 
following  analog  circuits  can  not  be  avoided  here. 
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Comb  filters  have  been  used  extensively  in  the  DSP  area  to  reduce  even 
(or  odd)  harmonic  components.  Unlike  analog  comb  filters,  digital  comb  filters  can 
be  easily  implemented  without  suffering  from  the  parametric  circuit  difficulties,  such 
as  process  variation,  device  mismatch,  and  temperature  drift  effects.  If  a digital  comb 
filter  which  is  designed  to  eliminate  unwanted  even  harmonic  components  is  added 
to  the  final  digital  stage,  some  of  harmonics  generated  in  the  digital  circuits  are 
reduced.  This  comb  filtering  effect  is  illustrated  in  Fig.  3.20  (b).  Fig.  3.21  shows  the 
MATLAB  results  simulated  when  there  is  no  comb  filter  (top)  and  when  the  comb 
filter  is  inserted  right  before  the  analog  stage. 


Fs=1.6GHz,  OSR=16  , input  le\el=0.9,  128-bit,  selected  bit[2571 -2698],  # of  periods=1,  1-bit  quantizer,  2nd-order  AX  & No  filtering 


Fs=1.6GHz,  OSR=16  , input  level=0.9,  128-bit,  selected  bit[2571 -2698],  # of  periods=1,  1-bit  quantizer,  2nd-order  A£  & No  filterinj 


Figure  3.21  MATLAB  simulation  results  showing  comb  filtering  effect;  without  a 
comb  filter(top)  and  with  a comb  filter(bottom).  With  a comb  filter, 
the  even  harmonics  are  drastically  suppressed,  and  the  fundamental 
and  the  odd  harmonics  are  increased  by  passband  gain  of  the  comb 
filter.  Since  the  filter  gain  at  the  fundamental  and  the  odd  harmonics 
are  same,  it  has  SFDR  unchanged. 
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The  periodically  generated  bitstream  swings  from  rail  to  rail.  For  an 
analog  noise  shaping  filter  to  work  on  this  digital  signal,  it  must  be  scaled  down  to 
an  analog  operation  input  range.  Another  reason  we  need  signal  level  scaling  is  the 
linearity  characteristic  of  the  analog  filter.  The  limitation  of  linear  operation  of  any 
analog  blocks  comes  from  the  gain  compression  and  third  harmonic  generation.  As 
shown  in  Fig.  3.22  (a),  an  ideal  linear  system  has  a linear  transfer  function,  which 
displays  no  distortion  even  if  large  inputs  are  applied.  However,  practical  systems 
have  non-linear  characteristics,  and  their  output  levels  can  be  saturated  for  large 
inputs.  If  the  input  signal  magnitude  is  small  enough  not  to  produce  any  distortion,  a 
linear  approximation  of  the  system  transfer  function  can  be  applied,  as  illustrated  in 
Fig.  3.22  (b).  For  a small  range  of  input  magnitudes,  the  output  signal  approximates 
the  linear  multiple  of  the  applied  input,  ignoring  the  constant  DC  term.  As  the  input 
magnitude  increases  further,  however,  the  output  begins  to  show  some  serious 
distortion  in  its  shape  shown  in  Fig.  3.22  (c).  Therefore,  by  scaling  down  the  digitally 


Figure  3.22  (a)  An  ideal  linear  transfer  function,  and  the  real  behavior  of  an 

amplifier  (b)  small  signal  input  applied  and  (c)  large  signal  input 
applied.  In  (a),  the  output  signal  is  linearly  dependent  on  the  input,  no 
matter  how  large  the  input  amplitude  is.  However,  as  shown  in  (b)  and 
(c),  practical  linear  systems  can  be  approximated  as  linear  only  if  the 
input  amplitude  is  small. 


69 


generated  signal,  we  can  get  an  improved  analog  filter  characteristic  to  generate  a 
better  signal  source.  However,  if  we  make  the  input  too  small,  we  will  lose  signal 
dynamic  range  because  the  smaller  signal  is  getting  closer  to  the  noise  floor  level. 
3.5.1  Required  Digital  Blocks 

For  digital  signal  processing,  as  shown  in  Fig.  3.3,  an  on-chip  memory,  a 
shift  register,  a clock  generator,  comb  filtering  circuits,  and  an  internal  clock 
monitoring  down-counter  are  necessary.  Since  we  are  interested  in  single-tone  and 
dual-tone  signals,  we  need  only  two  different  codes  modulated  for  each  case. 
Therefore,  on-chip  memory  will  not  need  to  be  reprogrammable  after  fabrication, 
even  if  different  signal  frequencies  are  necessary.  ROM  is  a good  candidate  given  its 
small  size.  Data  repetition  speed  is  very  critical  to  produce  a high  frequency  signal, 
and  is  determined  by  the  shift  register  speed.  Also,  this  shift  register  which  is 
configured  as  a rotator  must  have  a parallel  load  capability  to  load  ROM  data.  A 
variable  clock  generator  is  also  required  to  change  the  data  repetition  speed,  which 
determines  the  frequency  of  the  generated  signal.  An  internal  clock  which  is 
generated  by  a variable  clock  generator  must  be  very  high  speed  in  order  to  realize 
the  oversampling  technique.  This  internal  clock  is  directly  related  to  the  output  signal 
source  frequency,  as  expressed  in  the  definition  of  OSR.  Internal  clock  monitoring  is 
necessary  for  debugging  the  design,  so  the  clock  speed  is  scaled  down  using  a down- 
counter  block  to  interface  for  external  monitoring. 

To  implement  a digital  comb  filtering  circuit,  the  OSR  should  be 
determined  first.  For  example,  given  an  OSR  which  is  16,  a single-tone  fundamental 
signal  is  located  at  n/16  of  the  digital  frequency  domain.  For  this  specific  example, 
a 16th  order  comb  filter  is  necessary  to  remove  the  even  harmonic  components 
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located  at  2(ti/16),  4(tt/16),  6(tt/16),  8(tt/16 ),  10(tt/16),  12(tt/16),  14(tx/16), 
and  16(ti/16).  This  is  graphically  explained  in  Fig.  3.23. 


Figure  3.23  16th  order  comb  filtering  effect  on  single-tone  signal  generation.  The 
even  harmonic  components  are  cancelled  out  by  zeros  of  the  comb 
filter  transfer  function,  while  the  fundamental  tone  and  the  odd 
harmonic  components  are  remained  with  same  magnitude  change  due 
to  the  comb  filter  gain. 

The  transfer  function  of  this  16th  order  digital  comb  filter  can  be  expressed  as 

H(z)  = 1 - z-16  (3.8) 

In  the  time-domain  point  of  view,  this  means  a subtraction  between  the  current 
sample  value  and  the  16-sample  delayed  value,  which  is 

y(n)  = x(n)-x(n-  16)  (3.9) 

To  implement  this  operation,  a sample  delayed  element  is  required,  as  shown  in  Fig. 
3.24.  If  the  input  x(n)  is  completely  known  and  predictable,  the  comb  filter  may  be 
realized  without  a sample  delay  element.  In  this  memory-based  periodic  bitstream 
approach,  the  digital  data  bitstream  is  generated  by  replicating  the  finite  length  of  a 
pre-determined  bit  pattern.  Therefore,  by  tapping  the  shift  chain  in  the  middle,  a 
sample  delay  element  is  no  longer  necessary,  as  shown  in  Fig.  3.25. 

However,  dual-tone  signal  generation  has  some  problems  in  using  this 
comb  filtering  idea  to  reduce  the  harmonic  components  close  to  the  fundamental 
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Figure  3.24  Typical  implementation  of  a 16th  order  digital  comb  filter  using 
sample  delay.  To  implement  sample  delay  elements,  a shift  register 
chain  is  required. 


Figure  3.25  16th  order  comb  filter  implementation  without  delay  elements.  By 
tapping  2 different  positions  from  the  shift  register,  Eq.(2.9)  can  be 
calculated  without  sample  delay  elements. 

signal.  Fig.  3.26  (a)  shows  the  case  when  the  dual-tone  signals  are  generated  at  the 

frequency  of  (n/32)  and  2(n/32)  using  16th  order  comb  filter.  The  third  harmonic 

component  is  still  alive  just  (tt/32)  away  from  the  second  fundamental  component 

at  2( 7t/32 ).  Also,  filter  gain  is  not  the  same  at  both  ( 7t/32  ) and  2( rc/32 ).  To  get  the 

same  magnitude  of  these  two  tone  signals,  the  first  fundamental  signal  should  be 

larger  than  the  second  fundamental  signal  by  the  gain  difference.  Instead  of  a 16th 

order  comb  filtering  as  shown  in  Fig.  3.26  (a),  a 64th  order  comb  filtering  may  be 

used  to  reduce  the  sub-harmonic  components  located  at  (7t/64),  3(n/64),  5(n/64), 
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Figure  3.26  Several  digital  filtering  for  dual-tone  signal  generation  using  periodic 
bitstream  method,  (a)  frequency  spectrum  when  a 16th  order  comb 
filter  is  used.  Each  dual  fundamental  tones  have  different  gain 
magnitudes,  and  the  3rd  harmonic  component  can  not  be  removed,  (b) 
frequency  spectrum  when  a 64th  order  comb  filter  is  used.  Although 
dual  tones  have  identical  comb  filter  gain  magnitude,  the  3rd 
harmonic  is  still  remained  with  same  gain  magnitude  as  the 
fundamental  tones,  (c)  frequency  spectrum  which  eliminates  the  3rd 
harmonics,  maintaining  same  gain  magnitude  for  both  dual  tones. 
However,  it  requires  complicated  FIR  filtering  instead  simple  comb 
filtering. 
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and  so  on.  Since  the  frequency  response  is  shifted  by  (7t/64)  as  shown  in  Fig.  3.26 
(b),  the  transfer  function  of  this  filtering  is 

H(z)  = 1 + z-64  (3.10) 

The  only  differences  in  the  time-domain  operation  are  the  filter  order 
represented  by  the  sample  delay  amount  and  the  arithmetic  operation.  However,  the 
3rd-order  harmonic  component  can  not  be  eliminated  with  this  filtering.  Fig.  3.26  (c) 
shows  another  possible  method,  which  can  get  rid  of  the  third  harmonic  term 
maintaining  the  identical  dual-tone  magnitudes.  Removing  the  dominant  third 
harmonic  component  with  this  scheme  makes  it  much  easier  to  do  noise-shaping  near 
the  fundamental  frequency  range.  The  problem  is  that  the  filtering  transfer  function 
shown  in  Fig.  3.26  (c)  can  not  be  implemented  using  a simple  comb  filter,  but 
requires  a more  complicated  FIR  filter.  However,  this  approach  deserves  more 
research  in  the  future  for  high  precision  signal  generation. 

3.5.2  Required  Analog  Blocks 

In  this  embedded  test  design  philosophy,  much  effort  has  been  expended 
to  avoid  analog  circuit  blocks.  But,  a digital  filter  can  not  be  used  for  high  frequency 
noise  shaping,  because  it  needs  analog  anti-aliasing  filtering  to  make  the  digital  filter 
input  band-limited.  Therefore,  a noise-shaping  filter  should  be  implemented  as  a 
continuous-time  analog  filter.  Another  required  property  of  a noise-shaping  filter  in 
this  system  is  the  tunability  of  its  cutoff  frequency.  The  generated  signal  has  an 
adjustable  frequency,  and  unwanted  harmonic  components  are  near  the  fundamental 
frequency.  A sharp  transition  band  for  the  noise  shaping  filter  is  essential  to  isolate 
the  fundamental  signal  from  the  unwanted  harmonics.  The  variable  clock  speed 
changes  not  only  the  fundamental  signal  frequency,  but  also  the  unwanted  harmonic 


74 


locations  on  the  frequency  axis.  That  means  new  locations  for  the  unwanted 
harmonic  components  might  be  included  in  the  passband  range  of  noise  shaping 
filter,  as  shown  in  Fig.  3.27  (a).  Large  gain  in  the  filter  passband  for  unwanted 
harmonic  components  deteriorates  the  generated  signal  quality,  especially,  SFDR. 
Therefore,  as  shown  in  Fig.  3.27  (b),  the  filter  characteristic  must  be  adjusted  to  push 
the  undesired  harmonics  out  into  the  stopband.  A tunable  filter  design  with  good 
linearity  is  not  straightforward,  and  this  is  a design  bottleneck  to  get  a high  precision 
signal  source. 

3.6  Summary 

In  this  chapter,  embedded  signal  source  generation  method  using  a 
memory-based  periodic  bitstream  has  been  described.  The  proposed  method  is  based 


(a)  (b) 


Figure  3.27  Necessity  of  tunable  cutoff  frequency  for  efficient  noise  shaping,  (a) 
By  varying  the  system  clock  frequency,  the  locations  of  fundamental 
and  its  harmonic  components  are  also  shifted.  New  harmonic 
components  may  be  located  in  the  filter  passband,  degrading  the 
signal  quality  (SFDR).  (b)  To  avoid  signal  degradation  described  in 
(a),  filter  cutoff  frequency  needs  to  be  adjustable.  Tunable  cutoff 
frequency  can  efficiently  suppress  the  unwanted  harmonic 
components. 
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on  the  delta-sigma  modulation  and  demodulation  process  to  get  high  quality  signal 
sources.  To  simplify  the  hardware  resources,  the  modulation  part  is  implemented 
using  MATLAB  to  obtain  modulated  code,  and  the  implemented  software  model 
examines  modulation  process  with  various  design  variables.  A MATLAB  behavioral 
model  representing  the  signal  generation  procedure  identifies  the  optimal  input 
conditions  to  maximize  SFDR.  The  demodulation  process  is  completed  on 
monolithic  integrated  circuits,  which  include  a memory  block,  a variable  clock 
generator,  a shift  chain  register,  and  a noise-shaping  analog  filter. 


CHAPTER  4 

IMPLEMENTATION  OF  EMBEDDED  SOURCE  GENERATOR  USING 
MEMORY-BASED  PERIODIC  BITSTREAM 

4.1  Introduction 

In  this  chapter,  circuit  design  and  implementation  of  the  embedded  source 
generation  method  using  a memory-based  periodic  bitstream  mentioned  in  Chapter  3 
is  discussed.  The  demodulation  process  to  restore  the  analog  signal  sources  from  the 
delta-sigma  modulated  codes  consists  of  two  major  parts,  which  are  a front-end 
digital  signal  processing  part  and  a post  analog  filtering  part  for  high  frequency 
noise-shaping.  The  digital  signal  processing  is  done  by  a data  storage  component 
(ROM),  a data  repeater,  a variable  clock  generator,  and  an  internal  clock  monitoring 
down  counter.  And  in  order  to  improve  the  signal  quality  by  reducing  the  undesired 
harmonic  components,  a digital  comb  filtering  circuit  is  also  designed  and  placed 
before  the  analog  filtering  circuit.  The  noise-shaping  analog  circuit  is  implemented 
using  a 4th  order  differential  GmC  filter  and  signal  scaling  circuits.  Finally, 
individual  circuit  blocks  are  integrated  into  a monolithic  IC  to  measure  and  analyze 
the  quality  of  signal  source  generator. 

The  generated  signal  quality  is  determined  by  the  type  of  analog  post- 
processing, because  efficient  noise-shaping  is  directly  related  to  signal  SFDR.  And, 
since  analog  circuit  functionality  is  easily  disturbed  by  digital  switching  noise,  a 
reliable  protection  scheme  for  sensitive  analog  circuit  blocks  from  the  noisy  digital 
circuit  is  also  important. 


76 


77 


4.2  Digital  Parts 

4.2.1  Data  Storage  Component:  ROM 

Software-generated  bitstream  code  is  stored  in  an  embedded  memory 
component.  Since  the  memory-based  approach  does  not  require  a re-programming 
capability,  ROM  is  a good  choice  for  its  small  size.  ROM  is  implemented  as  an  OR- 
type  structure  shown  in  Fig.  4.1(a),  in  which  the  presence  of  a NMOS  transistor  in 
the  pull-down  programs  a bit  value  ‘O’.  A small  PMOS  transistor  is  used  as  a pull- 
up,  and  its  gate  is  connected  to  ground  to  achieve  a fast  rise  time.  Only  one  of  four 
word  lines  may  be  active  during  the  ROM  read  time,  and  all  the  word  lines  are 
normally  inactive  except  the  ROM  read  line  to  avoid  unnecessary  power 
consumption. 


code2  O- 


code3  O- 


code4  O- 


(a) 


Figure  4.1  Transistor  level  structure  for  (a)  NOR-type  ROM  structure  (whole 
circuit  not  shown  here)  and  (b)  a positive-edge  triggered  flip-flop. 
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4.2.2  Data  Repeater:  Shifter  Register 

A data  repeater  is  implemented  as  a shifter  register  with  a feedback 
connection.  This  shifter  register  is  capable  of  being  loaded  in  parallel  with  ROM  data 
on  the  data  load  interval,  and  also  has  a “set”  and  a “reset”  function.  An  individual 
register  cell  is  realized  using  positive-edge  triggered  flip-flops,  shown  in  Fig.  4.1(b). 

4.2.3  Variable  Clock  Generator 

By  varying  the  timing  circuit  (ring  oscillator)  propagation  delay,  the  clock 
frequency  is  made  variable.  There  are  two  distinct  ways  of  having  controlled 
propagation  delay,  which  are  the  current-starved  approach  and  the  shunt  capacitor 
approach  [Joh88].  In  the  current-starved  inverter  shown  in  Fig.  4.2(a),  the  control 
input  Vctri  modulates  the  “ON”  resistance  of  pull-down  M4  and  pull-up  Ml  through 
current  mirroring.  These  variable  resistances  control  the  operating  current  to  charge 
or  discharge  the  load  capacitance.  A large  Vctri  allows  a large  current  to  flow  in  the 
inverter,  producing  a small  resistance  and  a small  inverter  propagation  delay. 

In  the  shunt  capacitor  inverter  shown  in  Fig.  4.2(b),  the  control  voltage 
Vctrl  adjusts  the  resistance  of  a shunt  transistor  M3,  which  connects  a load  capacitor 
C to  the  output  of  a inverter.  The  shunt  transistor  M3  controls  the  effective  load 
capacitance  seen  by  the  “Vout”  node,  which  in  turn  adjusts  the  inverter  propagation 
delay.  A large  Vctri  decreases  the  resistance  of  the  shunt  transistor  M3  and  the 
effective  load  capacitance  at  the  output  node  becomes  large,  producing  a large  delay. 

As  discussed  in  Johnson  and  Hudson  [Joh88],  a current-starved  topology 
has  a less  linear  characteristic  of  delay  time  versus  control  voltage  and  poor  noise 
rejection.  Therefore,  the  shunt  capacitor  delay  circuit  has  been  chosen  despite  the 
large  area  due  to  the  lumped  capacitor  C. 
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Figure  4.2  Two  different  ways  of  making  circuit  delay  controllable  using  (a)  a 
current-starved  inverter  and  (b)  a shunt  capacitor  inverter.  In  (a),  the 
input  voltage  Vctri  controls  the  operating  current  to  charge  or 
discharge  the  load  capacitance  (not  shown  here)  on  the  output  node 
(“Vout”).  1°  (b),  the  control  voltage  Vctr|  determines  the  resistance  of 
a shunt  transistor  M3,  which  controls  the  effective  load  capacitance  C 
seen  by  the  “Vout”  node. 


Fully  differential  inverters  are  commonly  used  to  obtain  better  power- 
supply  insensitivity  in  ring  oscillator  designs.  A fully  differential  structure  requires 
an  even  number  of  inverters  with  interchanged  outputs  at  the  last  stage  before  feeding 
them  back  to  the  inputs.  Fig.  4.3  shows  an  2-stage  ring  oscillator  with  XOR 
frequency  doubler.  Each  of  the  differential  inverters  consist  of  a cross-coupled 
PMOS  load  pair  and  voltage  controlled  shunt  capacitors.  The  attached  XOR  gate 
doubles  the  oscillation  frequency  generated  in  the  feedback  loop.  In  this  structure, 
each  adjacent  node  is  90  degree  out  of  phase.  Fig.  4.4  shows  the  simulation  result  of 
output  clock  frequency  with  varying  control  voltage  (Vctri)  using  Cadence  spectreS. 

As  discussed  in  Chapter  1,  a ring  oscillator  using  simple  digital  inverters 
can  generate  square  wave  pulses  which  may  be  used  as  a clock.  Combined  with  some 
shunt  capacitors,  this  ring  oscillator  can  produce  a variable  clock  source,  as  shown 


80 


Figure  4.3  2-stage  differential  ring  oscillator  with  XOR  frequency  doubler, 
where  (W/L)M1  = (19.2um/0.34um),  (W/L)M2  = (6.4um/0.4um),  (W/ 
L)M3  = (12.8um/0.4um),  and  C = 500fF. 

in  Fig.  4.5.  The  output  node  in  Fig.  4.5  does  not  have  a shunt  capacitor,  because  this 
node  already  has  a load  capacitance  comparable  to  the  controlled  shunt  capacitor. 
The  simulation  result  using  SpectreS  shown  in  Fig.  4.6  is  more  linear  than  the  result 
in  Fig.  4.4. 

4.2.4  Clock  Monitoring  Component:  Down  Counter 

Since  an  internal  clock  directly  determines  the  generated  signal  frequency 
as  a function  of  the  oversampling  ratio,  the  frequency  tuning  of  the  signal  is 
controlled  by  changing  the  internal  clock  speed.  That  is  the  reason  why  we  have  to 
monitor  the  internally  generated  clock.  A very  high  speed  clock  is  not  easily 
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Figure  4.4  Cadence  SpectreS  simulation  result  of  a 2-stage  ring  oscillator  using 
cross-coupled  PMOS  differential  inverters.  Due  to  the  voltage- 
controlled  load  capacitors  on  the  drain  nodes  of  the  PMOS  transistors, 
the  generated  clock  speed  becomes  slower  in  a nonlinear  fashion  with 
increasing  control  voltage  Vent  ( >1). 


Figure  4.5  5-stage  ring  oscillator  with  voltage-controlled  shunt  capacitors, 
where  (W/L)M1  = (19.2um/0.34um),  (W/L)M2  = (6.4um/0.4um),  (W/ 
L)M3  = (12.8um/0.4um),  and  C = 500fF. 
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Figure  4.6  Cadence  SpectreS  simulation  result  of  a 5-stage  ring  oscillator,  which 
shows  more  linear  and  monotonic  response  than  the  result  shown  in 
Fig.  4.5. 

measured  at  the  I/O  pad,  and  must  be  down  converted  to  a more  easily  characterized 
frequency.  In  this  design,  we  use  10-bit  down-counter  to  scale  down  the  clock 
frequency  by  1024. 

4.2.5  Digital  Comb  Filtering  Block 

General  digital  comb  filtering  requires  sample  delay  and  arithmetic 
operation  (addition  or  subtraction)  on  the  digital  data  to  be  processed.  This  procedure 
becomes  harder  as  the  data  width  increases,  because  the  required  hardware  size 
grows  with  increased  register  and  the  wider  multi-bit  adder  structures. 

In  the  proposed  architecture,  however,  1-bit  modulation  is  employed  to 
improve  the  linearity  characteristic  of  the  subsequent  DA  conversion  process  and  to 
simplify  the  circuit  integration.  Therefore,  the  1-bit  subtraction  of  a delayed  input 
from  the  current  input  can  be  implemented  as  shown  in  Fig.  4.7.  Its  truth  table  is 


summarized  in  Table  3.1. 
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Figure  4.7  Single-bit  subtraction  for  digital  comb  filtering,  “in”  is  the  current 
input,  and  “inD”  is  the  delayed  input  data.  3-level  output  data  is 
thermometer-encoded  using  2-bit  output,  “Tl”  and  “T2”,  as  shown  in 
Table  3.1. 


Table  3.1  Truth  table  of  Fig.  4.7 


in 

inD 

(in-inD) 

Tl 

T2 

0 

0 

0 

1 

0 

0 

1 

-1 

0 

0 

1 

0 

1 

1 

1 

1 

1 

0 

1 

0 

4,3  Analog  Parts 

4.3.1  Noise  Shaping  Filter 

Filters  can  be  classified  as  active  filters,  passive  filters,  and  distributed 
filters  according  to  the  constituent  elements.  In  the  1970’s,  active  filters  make  use  of 
operational  amplifiers  together  with  capacitors  and  resistors,  and  their  performance 
is  dependent  on  the  behavior  of  the  operational  amplifier.  Passive  filters  are 
frequently  composed  of  resistors,  capacitors,  and  inductors.  If  the  sensitivity  is  an 
important  design  factor,  passive  implementation  has  an  advantage.  If  filters  are 
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required  to  operate  in  a very  high  frequency  millimeter-wave  range,  only  distributed 
filters  work,  such  as  waveguide  or  coaxial  cable  filters  [Val82], 

There  are  three  main  designs  for  analog  integrated  active  filters,  which  are 
switched-capacitor  (SC)  filters,  MOSFET-C  filters,  and  Gm-C  filters.  SC  filters  use 
resistors  replaced  by  switched  capacitors,  and  their  time  constants  depend  on  the 
ratio  of  capacitors  and  the  clock  frequency.  Unlike  an  RC  time  constant,  the  ratio  of 
integrated  capacitors  can  be  set  quite  precisely,  on  the  order  of  0.1  percent  [Gra93], 
Fortunately,  the  clock  frequency  is  also  a well  defined  quantity  in  modern 
technology.  Since  continuous  data  sampling  causes  aliasing  problems,  SC  filters 
need  anti-aliasing  filters  and  extra  clock  circuits  which  are  not  required  for  other 
types  of  analog  filters.  And,  as  operating  frequency  increases,  the  limitations  of  the 
sampling  operation  using  MOSFETs  and  capacitors  become  apparent.  At  higher 
frequencies,  the  capacitors  must  be  charged  faster.  Hence,  the  transistors  should  be 
larger,  which  increases  the  parasitic  capacitance  causing  switching  noise  and  clock 
feedthrough. 

Another  type  of  filters  uses  MOSFETs  instead  of  passive  resistors;  this  is 
called  a “MOSFET-C  filter”  [Joh97].  The  classical  R-C  integrator  is  replaced  using 
this  structure.  A MOSFET  in  the  triode  region  can  be  modeled  as  a resistor.  Although 
the  transistor  resistance  can  be  tunable  by  adjusting  the  gate  voltage,  this  integrator 
has  some  drawbacks  when  combining  integrators  to  form  a filter.  The  operational 
amplifier  of  one  integrator  has  to  drive  the  MOSFET  resistors  present  in  the  next 
stage.  At  high  frequency,  it  is  not  easy  to  drive  small  resistors,  a big  amplifier  is 
needed.  Also,  modern  short-channel  MOSFET  technology  shows  some  non-ideal 
behavior  in  the  current-voltage  relationship  derived  from  long-channel  MOSFET 
devices,  which  can  result  in  signal  distortion. 
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Another  technique  developed  for  high  frequency  applications  is  the  “Gm- 
C”-based  circuit  [SchOl,  Kar92],  Unlike  switched-capacitor  circuits,  Gm-C  circuits 
work  in  the  continuous  time-domain,  and  do  not  require  pre-sampling.  Hence,  an 
anti-aliasing  filter  and  additional  clock  circuitry  are  not  necessary,  which  brings  a 
significant  speed  advantage.  Although  Gm-C  circuits  show  relatively  poor  linearity, 
poor  noise  performance,  and  require  some  tuning  circuitry,  the  Gm-C  filter  is  a good 
choice  for  high  frequency  analog  applications. 

3. 3.1.1  Design  Of  A Basic  Transconductor 

An  ideal  transconductor  has  infinite  input  and  output  impedance,  and  also 
its  transconductance  value  is  well-known  or  controllable,  while  an  operational- 
transconductance-amplifier  (OTA)  does  not  necessarily  need  to  have  well-defined 
transconductance  if  it  is  tunable.  In  most  integrated  applications,  it  is  more  desirable 
for  linearity  and  crosstalk  to  use  fully  differential  structure.  Also,  it  is  known  that 
fully  differential  circuits  have  better  noise  and  distortion  characteristics  [RazOO], 

Emitter  (or  source)  degeneration  techniques  have  been  commonly  used  to 
improve  circuit  linearity,  and  can  be  applied  to  transconductor  designs.  Fig.  4.8 
shows  a part  of  a transconductor  with  emitter  degeneration.  Given  identical 
transistors  Q1  and  Q2,  we  may  write 


V in+ 


R 

vwv 


V in- 


i 


Figure  4.8  Part  of  emitter  degenerated  transconductor. 
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Vin+  = vbe,Ql  +Rx'- Vbe,Q2  + Vin- 


(4.1) 


By  rearranging  terms, 


vin  = vin+-vin  = R x 1 + (vbe,Ql  ~ vbe,Q2) 


(4.2) 


Since  the  base-emitter  voltage  varies  very  little  when  the  transistors  are  in  the  active 
region,  the  transconductance  is  expressed  as 


C = _L~I 
m v _ R 

in 


(4.3) 


Hence,  the  output  current  can  be  given  by 


A(output  current)  = G x A(input  voltage)=  x A(input  voltage)  (4.4) 

R 


That  is,  the  output  current  is  an  approximately  linear  function  of  input  voltages  at  the 
cost  of  lower  gain  and  higher  noise. 

By  replacing  the  degeneration  resistor  R in  Fig.  4.8  by  the  transistors 
operated  in  the  triode  region,  the  transconductance  gm  is  tunable.  Fig.  4.9  shows  two 
common  approaches  applied  to  bipolar  input  transistors  to  create  tunable 
transconductance.  In  Fig.  4.9  (a),  assuming  identical  transistors  of  Q3  and  Q4,  long- 
channel  MOSFET  triode  operation,  and  identical  NFET  and  PFET  characteristics,  the 
degeneration  resistance  value  R can  be  expressed  as 


R = (rds3 


1 

2x{uC0X(W/L)3>4x(Vgs-Vth)} 


(4.5) 


where  rds  = rds3  = rds4.  Hence,  the  transconductance  value  Gm  is 
Gm  = 2 x { uCox(  W/L)3  4 x ( Vgs  - Vth) } 


(4.6) 


With  bipolar  input  transistors  as  in  Fig.  4.9  (a),  the  gate-source  voltage  Vgs  of 
transistor  Q3  and  Q4  is  equivalent  to  the  base-emitter  voltage  Vbe  of  transistor  Q1  or 
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Figure  4.9  Simplified  schematics  of  tunable  transconductance;  (a)  varying  bias 
approach,  and  (b)  fixed  bias  approach. 


Q2.  Since  the  base-emitter  junction  voltage  of  the  bipolar  transistors  is  almost 
constant,  it  is  not  easy  to  tune  Gm  with  this  bias  scheme.  In  Fig.  4.9  (b),  an  external 
pin  is  used  to  adjust  the  gate-source  voltage  of  degeneration  transistor  Q3,  which  is 
a fixed  bias  approach  [Joh97],  With  this  scheme,  the  transconductance  is  given  by 

Gm  = uC0X(W/L)x(Vgs-Vth)  (4.7) 

Here,  the  gate-source  voltage  of  transistor  Q3  is  determined  by  the  external  input 
voltage  Vcont  and  the  emitter  voltage  of  the  input  bipolar  transistors.  Therefore,  the 
transconductance  Gm  is  adjustable  and  controllable. 

Another  way  of  obtaining  a tunable  transconductor  cell  is  to  add  an  extra 
current  gain  cell  [Joh97],  as  shown  in  Fig.  4.10.  From  the  extra  gain  cell,  tunable  Gm 
value  is  expressed  as 


G 


m 


i h 

— x — 

R I, 


(4.8) 


In  order  to  avoid  excessive  design  complexity,  the  Gm-tunable  transconductor  has 
been  designed  using  a fixed  bias  scheme  shown  in  Fig.  4.9  (b). 
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Figure  4.10  Simplified  schematics  of  tunable  transconductance  using  an  extra 
current  gain  cell  approach. 

Fig.  4.1 1 shows  the  designed  fully-differential  transconductor  cell  with  its 
bias  circuit.  To  determine  the  output  common-mode  voltage  and  to  control  it  to  be 
some  specified  voltage,  a common-mode  feedback  circuit  (CMFB)  is  added  to  each 
transconductor  cell.  Basically,  the  cell  has  a cascode  structure  to  increase  the  output 
impedance,  and  has  an  emitter  degeneration  NMOS  transistor  using  fixed  bias 
scheme. 

3.3. 1.2  Design  Of  A Low  Pass  Biquad  GmC  Filter 

Basic  transconductors  can  be  used  for  building  other  GmC  circuit  blocks, 
such  as  filters,  gyrators,  and  resistor-free  resistor  equivalent  conductances.  By 
cascading  the  first  order  filters  or  the  biquad  filters,  higher  order  filters  may  be 
implemented.  A single-ended  low-pass  biquad  GmC  filter  is  shown  in  Fig.  4.12, 
whose  voltage  gain  can  be  expressed  as, 
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Figure  4. 1 1 Schematics  of  a transconductor  cell  and  its  bias  circuit;  (a)  bias  circuit 
and  (b)  a transconductor  cell  with  common-mode  feedback  (CMFB) 
circuit. 


Figure  4.12  Single-ended  low-pass  biquad  GmC  filter. 
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Low  pass  filter  characteristics,  such  as  cutoff  frequency,  DC  gain,  and  Q 
factor  value,  are  determined  from  the  individual  transconductance  of  each  of  the 
basic  transconductor  cells  constituting  the  GmC  filter.  Hence,  if  individual 
transconductance  is  adjustable  for  each  transconductor,  tunable  filter  characteristics 
may  be  obtained. 

In  order  to  create  a controllable  noise-shaping  filter,  three  different  kinds 
of  transconductors  are  designed  using  different  emitter  degeneration  NMOS 
transistors.  Using  these  primitive  cells,  a differential  4th-order  tunable  low  pass 
GmC  filter  has  been  built,  which  is  shown  in  Fig.  4.13.  Ground  plate  capacitor 
implementation  enables  us  to  avoid  floating  capacitors;  floating  capacitors  causing 
some  unwanted  parasitic  capacitance  between  the  bottom  plates  and  the  ground 
plane.  This  configuration  also  helps  stabilize  the  common-mode  feedback  network  at 
the  cost  of  a larger  area.  Fig.  4.14  shows  one  possible  simulated  result  when 
Vcm  = 1.65V,  Vbias  = 1.1V  , Vgam  = 2.5V  , Vqual  = 2.7V  , and  varying 
Vcutoff.  The  result  shown  in  Fig.  4.14  may  vary  with  other  combination  of  input 


control  signals. 
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Figure  4.13  Cascaded  biquad  structure  using  basic  transconductors. 


vcutoff  (V) 


Figure  4.14  3dB  cutoff  frequency  simulation  result  of  4th  order  GmC  filter  using 
Cadence  SpectreS  when  Vcm=  1.65V,  Vbias=l.lV,  Vgairi=2.5V,  and 
VqUal=2.7V. 
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Another  important  characteristic  of  analog  circuits  is  linearity.  Good 
linearity  characteristics  are  obtained  only  when  processing  undistorted  input  signals, 
and  excessively  large  input  signals  inevitably  cause  some  distortion  on  the  output. 
Linearity  performance  of  the  filter  is  simulated  by  Cadence  SpectreS.  Fig.  4.15(a) 
shows  the  simulation  setup  for  differential  structure  to  be  simulated,  and  Fig.  4.15(b) 
captured  its  DC  simulation  results.  Vcm  is  defined  half  power  supply  voltage,  and 
Vsweep  is  swept  from  OV  to  power  supply  level  to  generate  the  differential  DC  input 
pairs.  It  can  be  observed  that  the  output  signal  pairs  remained  linear  as  long  as  input 
signal  difference  is  less  than  lOOmV.  To  get  much  better  signal  quality,  however, 
differential  input  magnitudes  must  be  smaller  than  this  value  (< lOOmV). 

4,3.2  Digital-to-Analog  Interface  Circuit 

Since  an  active  differential  filter  with  an  input  range  limited  by  distortion 
is  used  for  noise-shaping  in  this  design,  a rail-to-rail  single-ended  digital  signal  can 
not  be  directly  processed  by  the  filters.  A small  allowable  input  voltage  range  for  the 
noise-shaping  analog  filter  must  be  defined  around  the  common-mode  voltage  level. 

Two  different  designs  have  been  implemented,  one  is  voltage  divider  type 
using  NMOS  transistors  shown  in  Fig.  4.16,  and  the  other  is  current-mode  voltage 
level  scaler  shown  in  Fig.  4.17.  In  Fig.  4.16,  NMOS  transistors  are  operating  in  the 
linear  region,  acting  as  resistors.  The  “tone”  input  is  from  the  digital  circuit,  traveling 
from  ground  level  (OV)  to  power  supply  level  (Vdd),  and  the  “Vcm”  is  common-mode 
level  of  the  differential  analog  filter.  Hence,  the  output  “Vinp”  is  defined  as 
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Figure  4.15  Linearity  simulation  of  the  4th  order  GmC  filter  (a)  Linearity 
simulation  setup,  and  (b)  its  simulation  result.  It  shows  linear 
characteristics  with  the  input  magnitude  less  than  lOOmv. 
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Figure  4.16  Voltage  divider  type  voltage  scaler  using  NMOS  transistors.  Voltage 
scaling  ratio  is  determined  by  the  transistor  size  ratio,  such  as  (Ml/ 
M2)  or  (M3/M4). 
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Since  (W/L)M1  is  much  smaller  than  (W/L)M2>  Req,Mi  is  much  larger  than  ReqjM2. 
Transistor  M3  and  M4  are  duplicates  of  Ml  and  M2,  forming  circuit  symmetry  for 
reducing  mismatch  effects. 

As  the  digital  comb  filtering  is  applied  in  front  of  the  analog  noise-shaping 
filter  and  digital  data  is  no  longer  1-bit  wide,  however,  a voltage  divider  type  shown 
in  Fig.  4.16  can  not  be  used.  Furthermore,  as  mentioned  in  the  section  3.3.1,  the 
above  resistor  approximation  of  MOS  transistors  in  the  linear  region  is  not  correct 
for  modern  short-channel  devices. 

To  handle  2-bit  wide  digital  data,  a current-mode  voltage  scaling  circuit 
shown  in  Fig.  4.17  has  been  designed.  It  has  different  digital  interface  input  pins 
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(a) 


(b) 

Figure  4.17  Current-mode  voltage  level  scaler  circuits  for  (a)  a positive  input 
(Vmp)  and  (b)  a negative  input  (Vinn)  of  the  differential  noise-shaping 
filter. 
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(“Tib”  and  “T2b”),  which  are  designed  to  match  the  digital  comb  filter  block  present 
in  the  final  stage  of  the  digital  circuit.  Input  data  “Tib”  and  “T2b”  act  as  a 
thermometer  code,  and  its  coding  process  was  described  in  the  section  3.2. 
Depending  on  the  state  of  input  pins,  transistor  M14  and  M16  of  Fig.  4.17  (a)  are 
turned  on  or  off.  Transistor  Ml 3 and  Ml 5 provide  current  paths  when  the  transistor 
M14  and  Ml 6 are  turned  off,  respectively.  Transistor  Mil,  Ml 2,  Ml 7,  and  Ml 8 
provide  constant  current  paths  to  keep  the  total  current  flowing  through  the  resistor 
R2  maintained.  The  gate  voltage  of  the  transistor  M26  depends  on  the  state  of  the 
input  “Tib”  and  “T2b”.  And  the  regulated  cascode  structure  formed  by  the  transistor 
M24,  M25,  and  M26  helps  increase  the  output  impedance  seen  at  the  drain  of  the 
transistor  M25  and  reduce  unexpected  output  current  fluctuation  to  stabilize  the 
output  nodes  (“Vinp”  and  “Vinn”)  voltage.  While  3-level  2-bit  digital  data  is  scaled 
into  the  positive  input  of  the  differential  analog  filter  using  the  circuit  shown  in  Fig. 
4.17.  (a),  the  negative  input  for  the  common-mode  signal  is  generated  using  the 
circuit  shown  in  Fig.  4.17.  (b). 

4.4  IC  Implementation 

4.4.1  Signal  Generator  Using  Memory-Based  Periodic  Bitstream  Approach  (Rev.l) 
First  prototype  (Rev.l)  was  built  using  0.25um  IBM6HP  SiGe  process. 
The  Rev.l  design  includes  a 4x64 -bit  ROM,  a 64-bit  shifter  register,  2-stage 
differential  inverters  with  a XOR  doubler,  a 10-bit  internal  clock  monitoring  down- 
counter,  and  a 4th  order  GmC  low-pass  filter,  as  seen  in  Fig.  4.18.  It  has  dedicated 
power  and  ground  lines  for  each  digital  and  analog  circuit  blocks,  which  helps  isolate 
noise-sensitive  analog  filter  from  noisy  digital  circuits,  even  though  two  separate 
ground  lines  are  virtually  connected  each  other  through  the  substrate  resistance.  The 
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Figure  4.18  Simplified  block  diagram  of  signal  generator  using  memory-based 
periodic  bitstream  approach  (Rev.l) 

embedded  analog  filter  consists  of  8 transconductor  cells  and  8 ground  plate  metal- 
insulator-metal  (MIM)  capacitors  forming  differential  4th  order  low-pass  filter.  8 
transconductor  cells  are  laid  out  surrounded  by  8 MIM  capacitors  for  secure 
protection  from  digital  circuit  blocks  using  huge  “Deep  Trench”  guard  ring.  (MIM 
capacitors  are  implemented  on  the  “Deep  Trench”  layer.)  And,  the  analog  filter  block 
has  a lot  of  substrate  contacts  around  the  boundary  to  the  digital  circuits  to  reduce 
the  parasitic  resistance  associated  with  the  substrate. 

Digital  circuit  blocks  can  be  operated  with  an  external  clock  source  forced 
via  “dummy  clock”  pin  as  well  as  the  internally  generated  clock,  “osckillb”  input 
selects  one  of  the  two  available  clock  sources.  The  external  clock  source  is  useful 
when  the  internal  clock  generator  is  not  working  properly.  However,  external  clock 
speed  during  package  sample  measurement  can  not  be  comparable  to  the  clock  speed 
generated  by  internal  variable  clock  generator,  due  to  the  limitation  of  external  pin 
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driving  capability.  (With  the  availability  of  probecards  and  direct  access  to  the  IC 
pads  without  bonding  wires,  this  externally  driving  clock  speed  could  be  increased.) 
The  chip  floorplan  of  Rev.l  is  shown  in  Fig.  4.19  with  pads  and  actual  core  size  is 
(900um  x 650um) . 
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Figure  4.19  Floorplan  of  signal  generator  using  memory-based  periodic  bitstream 
approach  (Rev.l) 

Final  full  chip  simulation  has  been  done  by  the  Cadence  SpectreS.  Unlike 
other  pure  digital  circuits,  this  mixed-signal  design  must  be  verified  at  the  transistor 
level.  Transistor  level  simulation  takes  a lot  of  simulation  time  and  schematic  work 
whenever  different  design  ideas  are  attempted.  Especially,  ROM  code  modification 
brings  about  much  schematic  work,  which  would  be  easier  to  do  in  text  level  work. 
Also,  in  the  case  of  a shift  register,  each  cell  has  same  structure  and  the  only  node  of 
interest  is  one  output  node.  Except  one  register  cell  producing  the  necessary  output 
which  is  interfaced  with  an  analog  part,  whole  circuit  does  not  need  to  be  simulated 
at  the  transistor  level.  For  efficient  full  chip  simulation,  a shift  register  and  a ROM 
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are  replaced  by  VerilogA  models,  as  shown  in  Fig.  4.20.  These  partial  behavioral 
models  save  simulation  time  and  memory  usage  required  during  the  simulation. 
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Figure  4.20  Simulation  model  for  mixed-signal  design.  For  efficient  full  chip 
simulation,  a shift  register  and  a ROM  are  replaced  by  VerilogA 
models,  except  one  register  cell  producing  the  necessary  output. 
These  partial  behavioral  models  save  simulation  time  and  memory 
usage  required  during  the  simulation. 

Single-tone  and  dual-tone  simulation  results  of  Rev.l  are  shown  in  Fig. 
4.21  (a)  and  (b),  respectively.  Final  full  chip  simulation  done  by  Cadence  SpectreS 
uses  a transistor  level  netlist  for  the  whole  chip,  and  reports  about  35dB  SFDR  for 
single-tone  case  and  25dB  SFDR  for  dual-tone  case.  It  should  be  mentioned  that  both 
results  suffer  from  harmonic  components,  especially,  even  harmonic  ones. 

4.4.2  Signal  Generator  Using  Memory-Based  Periodic  Bitstream  Approach  (Rev.2) 
Design  revision  (Rev.2)  has  been  done  to  improve  the  signal  quality  in  the 
standpoint  of  SFDR.  From  the  research  plan,  Rev.2  was  supposed  to  be  developed 
after  getting  feedback  from  Rev.l  design  through  careful  design  verification  and 
measurement.  However,  Rev.l  fabrication  schedule  was  unexpectedly  postponed 


100 
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Figure  4.21  Cadence  SpectreS  simulation  results  (Rev.l);  (a)  single-tone  signal 
generation  and  (b)  dual-tone  signal  generation.  The  simulation  results 
show  about  35dB  SFDR  for  single-tone  case  and  25dB  SFDR  for  dual- 
tone case. 

until  the  Rev. 2 design  has  been  finished.  Since  Rev. 2 could  not  reflect  on  the  weak 
points  of  Rev.l  by  using  real  measurement  results,  design  revision  has  been  done 
using  only  simulation  results.  Rev. 2 makes  use  of  a single-ended  5-stage  ring 
oscillator  with  voltage  controlled  shunt  capacitors  to  vary  the  load  capacitance  on 
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each  inverter  output  nodes,  and  digital  comb  filtering  on  the  1-bit  data  bitstream 
repeatedly  generated  by  shifter  register,  as  discussed  in  Chapter  3.  Furthermore,  a 
voltage  scaling  circuit  to  interface  the  analog  noise-shaping  filter  has  been  replaced 
by  current-mode  operating  circuit,  because  the  voltage  divider  circuit  using  linear 
region  transistors  employed  in  Rev.l  shows  unpredictable  characteristics  with  short- 
channel  transistors.  And,  the  output  buffer  has  been  attached  to  the  analog  filter 
output  to  boost  the  output  signal  level,  which  could  be  easily  measured  externally. 
However,  an  output  buffer  inserted  at  the  final  stage  right  before  the  signal  output 
pads  would  deteriorate  the  signal  quality,  due  to  its  limited  linearity  characteristics. 
Top  level  schematic  and  full  chip  floorplan  of  Rev. 2 are  shown  in  Fig.  4.22  and  Fig. 
4.23,  respectively. 
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Figure  4.22  Top  level  schematic  of  signal  generator  using  memory-based  periodic 
bitstream  approach  and  digital  comb  filtering  (Rev. 2) 
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Figure  4.23  Floorplan  of  signal  generator  using  memory-based  periodic  bitstream 
approach  and  digital  comb  filtering  (Rev. 2) 

Full  chip  simulation  using  Cadence  SpectreS  has  been  done  to  estimate  the 
SFDR  and  the  frequency  of  generated  signal  sources.  For  single-tone  generation 
case,  as  seen  in  Fig.  4.24,  it  shows  about  45dB  SFDR  (=  (-  40dB  - (-85dB))) . 
Despite  the  reduced  even  harmonics  via  the  embedded  digital  comb  filter,  there  is 
still  a dominant  second  harmonic  component  caused  by  the  analog  noise-shaping 
filter.  It  should  be  mentioned  that  as  the  cutoff  frequency  of  the  noise-shaping  filter 
varies,  the  SFDR  could  be  also  varied. 

Dual-tone  signal  generation  becomes  more  complicated,  because  many 
dual-tone  test  environments  require  those  tones  to  be  in  the  baseband  with  identical 
magnitudes.  For  the  example  shown  in  Fig.  4.25,  the  generated  dual-tone  sources  are 
located  at  25MFIz  and  50MHz.  Unwanted  harmonic  components,  which  are  located 
at  75MHz,  100MHz,  125MHz,  and  so  on,  definitely  should  be  removed  or  suppressed 
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to  improve  the  SFDR  of  dual-tone  source.  To  reduce  the  harmonic  component  at 
75MHz  efficiently,  we  have  to  move  the  3dB  roll-off  frequency  of  filter  close  to  the 
50MHz  component.  Different  gains  of  filter  frequency  response  at  25MHz  and 
50MHz  make  the  magnitudes  of  dual  tones  of  generated  source  different.  Therefore, 
50MHz  component  should  be  larger  than  25MHz  component  during  the  modulation 
step,  considering  its  smaller  gain  after  the  noise-shaping  process.  The  cutoff 
frequency  of  noise-shaping  filter  applied  to  the  example  of  Fig.  4.25  can  be 
decreased  to  match  the  magnitudes  of  two  tones  at  25MHz  and  50MHz.  Cadence 
SpectreS  simulation  results  show  less  than  40dB  SFDR  for  dual-tone  generation  case. 
The  closer  3dB  roll-off  frequency  is  to  the  dual  fundamental  components  for  noise- 
shaping, the  better  the  signal  quality  is  with  unmatched  filter  gain  at  the  dual-tone 
frequencies. 


Figure  4.24  Cadence  spectreS  simulation  result  of  single-tone  generation  (Rev. 2). 
The  simulated  SFDR  is  about  45dB. 


104 


Time  Domain  Response  DJ  DFT  Result 


Figure  4.25  Cadence  spectreS  simulation  result  of  dual-tone  generation  (Rev. 2). 

The  simulated  SFDR  is  about  40dB. 

4.5  IC  Measurement 
4.5.1  Measurement  Environment 

As  a primary  motivation  of  this  work,  easy  measurement  setup  and  simple 
environment  have  been  pursued  throughout  the  whole  design.  Unlike  the  previous 
research  on  signal  generation  done  by  Dufort  and  Roberts  [DufOOb],  this  work  does 
not  require  preliminary  code  loading  before  normal  operation.  Except  for  a few 
digital  control  input  signals,  every  input  is  simply  a time-invariant  DC  value,  which 
is  easy  to  handle  for  measurement.  The  small  output  signal  level  of  Rev.  1 needed  to 
be  boosted  for  measurement,  however,  necessitating  another  amplifier  stage,  as 
illustrated  in  Fig.  4.26.  The  PCI  board  provided  by  National  Instruments  (NI) 
Lab  View  delivered  some  digital  inputs  to  initialize  the  shift  register  and  clock 
generator  [NatOO],  Also,  a battery  is  used  to  supply  clean  power  to  the  separate 


amplifier. 
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Figure  4.26  Illustration  of  measurement  setup  for  the  signal  generator  (Rev.l  and 
Rev. 2)  using  the  proposed  memory-based  periodic  bitstream 
approach. 

If  an  additional  power-on-reset  circuit  is  integrated  into  this  design,  the 
test  equipment  can  be  more  simplified  by  removing  the  NI  LabView  PCI  board.  Fig. 
4.27  shows  a simple  circuit  for  power-on-reset  generation,  and  its  generated  “Reset” 
signal  can  be  used  to  reset  internal  registers  and  counters. 
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Figure  4.27  An  example  of  a power-on-reset  circuit  and  its  timing  diagram.  Reset 
time  duration  can  be  controllable  with  the  capacitor  value  of  C. 
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4.5.2  Measurement  Results  And  Analysis 

Fabricated  samples  for  Rev.l  and  Rev. 2 have  been  provided  by  MOSIS  as 
diced  bare  wafer,  40  per  each  designs.  Each  wafer  samples  are  bonded  using  a ball 
bonder,  and  measured  for  DC  characteristics  using  HP4155.  Fig.  4.28  shows  some 
captured  measurement  data  on  8 different  pads,  which  are  a digital  power  supply  pad 
(Fig.  4.28  (a)),  an  analog  power  supply  pad  (Fig.  4.28  (b)),  an  analog  ground  pad 
(Fig.  4.28  (c)),  a digital  input  (“tonesell”)  pad  (Fig.  4.28  (d)),  an  analog  input 
(“vcm”)  pad  (Fig.  4.28  (e)),  a digital  input  (“toneselO”)  pad  (Fig.  4.28  (f)),  a clock 
generator  control  input  (“vent”)  pad  (Fig.  4.28  (g)),  and  another  digital  input 
(“osckill  b”)  pad  (Fig.  4.28  (h)).  In  the  case  of  Fig.  4.28  (a),  the  digital  power  supply 
pin  is  forced  by  a control  voltage  source  with  0V  biased  digital  ground  pin.  However, 
each  of  the  CMOS  transistors  between  the  power  and  ground  pins  have  floating  gate 
inputs,  which  causes  some  static  current  as  shown  in  Fig.  4.29,  as  the  control  voltage 
increases.  The  case  of  Fig.  4.28  (b)  is  the  same  situation  as  in  the  Fig.  4.28  (a),  but 
with  a smaller  current  path  from  the  supply  node  to  the  ground. 

Since  the  analog  ground  pin  is  grounded  for  the  whole  period  of 

measurement,  there  is  some  current  paths  through  the  substrate  contacts.  The 

measurement  data  shown  in  Fig.  4.28  (c)  shows  that  the  substrate  resistance  is  about 

30ohm  \ = — 2V  - 0V — j pjowever  this  value  is  not  absolute,  because  the  number 
V 65mA  -0mA2 

of  substrate  contacts  and  the  physical  distance  between  digital  ground  and  substrate, 
and  analog  ground  and  substrate  can  vary  the  substrate  resistance.  Fig.  4.28  (d)  and 
(e)  show  the  measurement  results  of  CMOS  and  NMOS  transistor  input  nodes, 
respectively.  The  gate  input  of  any  CMOS  transistors  is  normally  floating  and  high 
impedance.  Therefore,  working  transistors  have  no  DC  current  path  from  the  gate 


107 


CURB  ON  l NN7.W0IHJBMIBB4U  1W.  534Bl8B«O0nB  > 

HAItKKnt  Sea.CaO  -I.TtltuA  > 

WIN  9ID  Bi>v 

ia>  r oo  r 4.7eeNu<) 


(« > 


Figure  4.28  DC  measurement  results  using  HP4155  showing  (a)  a digital  Vdd  pad, 
(b)  an  analog  Vdd  pad,  (c)  an  analog  ground  pad  (d)  a digital  input 
(“tonesell”)  pad,  (e)  an  analog  input  (“vcm”)  pad,  (f)  a blown-out 
input  (“toneselO”),  (g)  a blown-out  input  (“vent”),  and  (h)  a blown- 
out  input  (“osckill_b”).  In  (f),  (g),  and  (h),  as  the  applied  gate  voltage 
is  increased,  the  current  flowing  to  the  gate  is  also  almost  linearly 
increased. 
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Figure  4.29  An  example  showing  the  formation  of  static  current  path  with  gate 
nodes  floating.  The  undetermined  gate  voltage  of  each  transistor  may 
make  both  P-type  and  N-type  transistors  turned  on  at  the  same  time, 
allowing  the  static  current  from  the  power  supply  node  to  the  ground 
node. 

node  to  the  source  or  drain,  aside  from  the  leakage  current.  Fig.  4.28  (d)  and  (e)  show 
only  small  random  current  caused  by  noise,  not  by  leakage. 

While  measuring  the  V-I  characteristic  of  fabricated  wafer  samples,  a 
serious  problem  was  discovered  in  these  digital  input  nodes  (a  ROM  code  selection 
control  signal  “toneselO”,  an  variable  clock  control  signal  “vent”,  and  a system  clock 
source  selection  signal  “osckill  b”  in  Fig.  4.28).  The  gate  node  acts  like  a resistor, 
and  the  output  current  is  almost  monotonically  increasing  as  the  forcing  control 
voltage  increases,  as  shown  in  Fig.  4.28  (f),  (g),  and  (h),  which  means  that  the  gate 
node  is  no  longer  floating  and  can  not  control  the  transistor.  This  catastrophic 
malfunction  can  be  explained  by  excessive  Electrostatic  Discharge  (ESD)  applied  to 
the  inputs  which  frequently  occurs  in  dry  cold  weather.  As  seen  in  Fig.  4.30  (a),  if 
the  gate  oxide  capacitance  of  small-sized  short  channel  transistors  without  any 
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(a)  (b) 

Figure  4.30  (a)  Illustration  of  gate  oxide  blown  out  due  to  ESD  (Electrostatic 

Discharge),  and  (b)  an  example  of  ESD  protection  structure 

protection  structure  is  exposed  to  excessive  charge  from  ESD,  the  gate  oxide  can 

rupture  and  short  to  the  channel  under  the  gate.  Due  to  the  ESD,  the  oxide  layer 

designed  to  act  as  an  insulator  is  turned  into  a conductor,  resulting  in  resistive 

behavior.  This  explains  the  phenomenon  shown  in  Fig.  4.28  (f),  (g),  and  (h). 

In  order  to  avoid  this  ESD  problem,  we  must  provide  extra  current  paths 

for  exceptionally  large  positive  and  negative  charges  to  flow  through  the  power  and 

ground  nodes.  Given  a couple  of  diodes  parallel-connected  to  the  gate  input  node,  as 

shown  in  Fig.  4.30  (b),  2 additional  paths  are  now  available,  one  using  the  diode  D1 

is  for  handling  large  positive  charge,  and  the  other  path  through  the  diode  D2  is  for 

sinking  large  negative  charge.  And,  enlarging  the  buffer  is  also  helpful  to  protect  the 

small  gate  vulnerable  to  any  abnormal  conditions,  since  a bigger  gate  input  buffer  is 

much  more  robust  with  its  larger  capacitance.  Besides  the  methods  discussed  in  Fig. 

4.30  (b),  there  are  many  other  ESD  protection  techniques  developed  by  IC 

manufacturers,  because  commercial  ICs  have  more  strict  design  specifications  to 


protect  their  products. 
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Many  packaged  samples  (Rev. 2)  are  not  working  due  to  these  damaged 
transistors,  especially  digital  control  transistors.  The  Rev. 2 design  was  revised  to 
strengthen  the  immunity  to  the  unexpected  ESD  problem  by  adding  huge  diode- 
connected  MOS  transistors  and  large  input  buffers  to  every  digital  input  node.  In 
addition,  small  resistors  are  inserted  between  the  digital  pad  and  the  internal  gate 
oxide  node,  helping  decreasing  the  amount  of  current  flowing  into  the  signal. 

With  more  careful  packaging  work  and  handling  as  well  as  design  revision 
(Rev.2.1),  we  can  achieve  reasonable  measurement  results  from  Rev.2.1.  Fig.  4.31 
shows  a measured  waveform  of  the  ring  oscillator  variable  clock  generator  using  the 
test  environment  shown  in  Fig.  4.26.  As  the  voltage  of  the  control  gate  connected  to 
shunt  capacitors  varies,  the  output  frequency  of  monitoring  down-counter  also 
varies.  Comparison  of  the  simulated  result  and  measurement  data  is  done  in  Fig.  4.32. 
Three  different  samples  were  measured  to  get  the  relation  between  control  signal 
voltage  and  the  output  clock  frequency,  and  internal  clock  speed  is  calculated  by 
multiplying  the  measured  frequency  by  1024,  since  the  output  clock  is  outcoming 
through  the  10-bit  down-counter.  All  those  3 samples  display  consistent  results, 
while  showing  a large  difference  from  the  simulated  results.  This  discrepancy  comes 
from  the  routing  metal  capacitance  which  was  not  included  in  simulation. 

Another  measurement  difficulty  is  caused  by  digital  signal  switching 
noise,  which  is  mainly  associated  with  the  digital  circuit  blocks.  These  blocks  are 
triggered  by  the  system  clock.  The  CMOS  digital  logic  has  no  static  current 
dissipation  during  normal  operation,  but  a large  short  circuit  current  is  unavoidable 
at  the  instant  of  the  logic  state  transition.  The  clock  monitoring  output  has  to  drive 
large  capacitance  of  external  measurement  equipment,  which  induces  a large  internal 
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Figure  4.31 


(b) 


(c) 

The  measurement  results  of  clock  monitoring  down-counter  output 
(Rev. 2.1);  (a)  measured  output  frequency  is  1.08MHz  when  Vcnt  = 
1.0V,  (b)  measured  output  frequency  is  922KHz  when  Vcnt  = 2.0V, 
and  (c)  measured  output  frequency  is  493KHz  when  Vcnt  =3.3V.  The 
real  clock  frequency  generated  by  the  on-chip  variable  clock 
generator  is  1024  times  faster  than  this  measured  value,  because  10- 
bit  down-counter  was  used.  It  shows  that  the  clock  speed  gets  slower 
with  the  increasing  control  voltage  Vcnt. 
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Figure  4.32  Variable  clock  comparison  between  package  sample  measurement 
and  simulation  results  (Rev. 2.1).  Due  to  the  routing  metal 
capacitance,  which  is  not  considered  for  simulation,  measured  clock 
speed  is  much  slower  than  the  simulated  results.  3 different  package 
samples  show  very  similar  trend  of  behavior  on  different  control 
voltages. 

current  sinking/sourcing  via  the  ground  node.  Every  state  transition  of  a huge  output 
buffer  may  cause  serious  glitch  problems  in  the  neighboring  circuits,  resulting  in 
fluctuation  on  the  ground  node.  This  unstable  ground  node  directly  affects  analog 
filter  operation,  deteriorating  the  generated  signal  quality.  Therefore,  good  power 
supply  rejection  ratio  (PSRR)  of  an  analog  part  is  required  for  mixed-signal  design. 
Fig.  4.33  shows  measured  a down-counted  clock  signal  (top)  and  a generated  signal 
source  output  (bottom)  which  is  disrupted  by  the  huge  signal  interference  from  the 
digital  circuit  blocks. 

One  more  design  variable  to  be  considered  during  design  and 
measurement  stages  is  the  parasitic  effects  originated  from  probe  or  bondwire 
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Figure  4.33  Captured  measurement  results  (Rev. 2.1)  of  the  clock  monitoring 
output  (top)  and  the  generated  signal  source  output  (bottom).  The 
state  transition  of  the  clock  monitoring  signal  makes  huge  glitch  on 
the  generated  signal  source  output. 

connections.  Even  though  they  do  not  have  effects  as  serious  as  in  RF  circuit  design, 
they  are  still  working  against  the  proper  operation  of  sensitive  analog  circuit  parts, 
especially  through  power/ground  lines  and  DC  control  input  lines.  Also,  any 
undesired  loading  effects  from  a measuring  oscilloscope  should  be  reflected  in  IC 
design  and  measurement  analysis.  Fig.  4.34  shows  the  reconstructed  simulation 
model  of  a full  chip  signal  generator,  including  a probe  model  and  an  oscilloscope 
model.  The  parasitic  inductance  L and  the  resistance  R of  the  probe  model  parameters 
are  changeable  depending  on  the  physical  condition  of  the  probe  used  (for  example, 
material  and  length)  and  the  frequency  of  the  signal  of  interest.  The  oscilloscope 
model  is  provided  by  the  manufacturer,  as  shown  in  Fig.  4.34. 

To  investigate  the  effects  of  these  additional  circuit  elements  in  signal 
generation,  the  full  chip  schematic  of  Rev. 2.1  design  was  resimulated  using  Fig. 
4.34.  Since  the  experimental  data  for  the  probe  model  is  not  provided,  the  0.1  nH/ 


114 


Figure  4.34  Signal  generator  simulation  model  including  the  parasitic  probe  and 
oscilloscope  models.  While  the  oscilloscope  model  is  provided  by  the 
manufacturer  as  shown  above,  the  probe  model  (a  series  model  of 
inductance  L and  resistance  R)  is  not  known.  If  the  package  samples 
are  used  for  measurement,  the  probe  model  should  be  replaced  by  the 
bonding  wire  and  package  leadframe  model. 

0.2Q  and  0.5nFl/l  Q cases  were  simulated  using  Cadence  SpectreS,  as  shown  in  Fig. 
4.35.  The  simulation  results  shown  in  Fig.  4.35  are  apparently  worse  than  the 
simulation  result  shown  in  Fig.  4.24,  which  is  simulated  without  any  external 
elements. 


Signal  fluctuation  shown  in  Fig.  4.35  due  to  the  parasitic  inductance  and 
resistance  can  be  reduced  by  using  bypass  capacitors  on  power  supply  nodes  and  DC 
control  inputs.  Fig.  4.36  shows  the  simulation  results  with  on-chip  bypass  capacitors 
on  the  digital  and  analog  power  supply  lines.  In  Fig.  4.36  (a),  a probe  model  of  0.5nH/ 
IQ  and  2 bypass  capacitors  of  lOnF  are  used  for  full  chip  simulation,  which  shows 
that  high-frequency  noise  fluctuation  is  drastically  reduced.  Flowever,  lOnF 
capacitors  are  too  huge  to  integrate  on  the  chip,  even  though  high  density  MOSCAP- 
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Figure  4.35  Cadence  SpectreS  simulation  results  of  the  signal  generator 
(Rev. 2. 1);  (a)  with  a probe  model  of  0.  lnH/0.2Q , and  (b)  with  a probe 
model  of  0.5nH/lQ.  Both  (a)  and  (b)  also  include  the  oscilloscope 
model  which  is  shown  in  Fig.  4.34. 
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Figure  4.36  Cadence  SpectreS  simulation  results  of  the  signal  generator 
(Rev. 2.1);  (a)  with  a probe  model  of  0.5nH/lQ  and  lOnF  bypass 
capacitors,  and  (b)  with  a probe  model  of  0.5nH/l  Q and  20pF  bypass 
capacitors  on  the  digital  and  analog  power  supply  nodes. 
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based  capacitors  are  available.  Using  more  practical  size  for  capacitors  (20pF),  the 
whole  circuit  was  resimulated,  showing  the  results  in  Fig.  4.36  (b).  From  the  results 
in  Fig.  4.36,  the  on-chip  bypass  capacitors  are  strongly  recommended  for  suppressing 
the  high-frequency  noise  on  the  power  lines  and  the  generated  signal  output. 

Another  possible  problem  may  occur  with  the  usage  of  the  digital 
oscilloscope.  The  digital  oscilloscope  is  commonly  used  to  analyze  in  the  frequency- 
domain.  Though  the  conventional  digital  oscilloscope  provides  an  fast  Fourier 
Transform  (FFT)  function  which  is  very  useful  to  transform  the  time-domain  signal 
to  the  frequency-domain,  it  causes  significant  loading  effects.  This  inserts  a 
dominant  pole  in  the  low  frequency  range,  due  to  its  a huge  capacitace  and  resistance 
(shown  in  Fig.  4.34).  This  unwanted  dominant  pole  may  unduly  shrink  the 
fundamental  signal  tones. 

4.6  Summary 

In  this  chapter,  the  circuit  design  and  implementation  of  the  signal 
generator  using  a memory-based  periodic  bitstream  was  described.  The  Rev. 2 design 
has  been  fabricated  using  0.25um  IBM6HP  SiGe  process  and  tested  for  DC 
characteristics  using  HP4155.  The  DC  measurement  results  show  the  serious  ESD 
problems  on  many  MOS  I/O  signal  pads,  which  makes  the  digital  control  circuit 
blocks  uninitialized  with  external  inputs.  The  Rev. 2.1  design  has  been  revised  from 
the  Rev. 2 for  the  reliable  ESD  immunity  and  slight  change  of  the  DA  interface 
circuit.  After  packaging  work  and  the  subsequent  measurement  with  careful 
handling,  the  digital  circuit  blocks  are  turned  out  to  be  working  as  expected  from  the 
Cadence  SpectreS,  except  for  the  discrepancy  in  the  generated  internal  clock  speed. 
(This  is  because  the  designed  circuit  verification  through  the  transistor-level 


118 


simulation  does  not  consider  the  routing  metal  parasitics.)  However,  expected  signal 
source  output  could  not  be  properly  measured  due  to  the  following  reasons. 

(1)  The  signal  fluctuation  caused  by  the  parasitic  inductance  and 
resistance  of  a bonding  wire  or  probe  model  ruined  the  proper  operation  of  post- 
processing by  current-mode  DA  interface  circuit  and  the  active  filter. 

(2)  The  digital  circuit  switching  noise  promoted  by  large  output  buffers  in 
Rev. 2.1  design  seriously  degraded  the  sensitive  analog  circuit  blocks. 

(3)  The  diminished  signal  output  level  due  to  the  huge  loading  effect  from 
monitoring  measurement  equipment  is  another  reason  the  generated  source  could  not 
be  confirmed.  The  reduced  signal  is  easily  mixed  up  with  the  interfering  noisy  signal 
present  in  the  test  board. 


CHAPTER  5 

EMBEDDED  SOURCE  GENERATION  USING  LFSR  APPROACH  WITH  FIR 

FILTERING 

5.1  Introduction 

The  memory-based  periodic  bitstream  approach  proposed  in  Chapter  3 and 
Chapter  4 uses  only  two  frequency  bins,  as  shown  in  Fig.  3.4.  The  small  number  of 
frequency  bins  available  in  embedded  signal  generation  saves  a lot  of  hardware,  and 
shortens  the  length  of  the  bitstream  to  be  programmed  on-chip.  With  this  approach, 
however,  the  unwanted  dominant  harmonic  components  are  located  close  to  the 
desired  fundamental  component(s).  And,  it  is  not  easy  to  filter  out  these  harmonic 
components.  To  guarantee  a high  spurious-free  dynamic  range  (SFDR),  a low  pass 
filter  having  very  sharp  bandwidth  characteristics  must  be  built.  Furthermore,  the 
cutoff  frequency  should  be  tunable.  This  is  difficult  considering  the  limited  silicon 
area  and  required  high  linearity  of  the  filter. 

In  this  chapter,  the  embedded  signal  generation  method  using  linear 
feedback  shift  register  (LFSR)  chains  and  finite  impulse  response  (FIR)  filtering 
technique  is  described.  This  method  reduces  the  required  hardware  resources  by 
simplifying  the  digital  circuit  blocks,  and  also  eases  the  filtering  design  specification 
by  eliminating  the  intermediate  harmonic  components.  This  chapter  includes  the 
basic  idea  of  using  LFSR  chains  and  FIR  filtering,  system  performance  estimation, 
overall  hardware  structure,  IC  implementation,  its  measurement  and  analysis. 
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5.2  Basic  Concept 

Analog  FIR  filtering  before  the  noise  shaping  stage  is  another  approach  to 
reduce  the  dominant  harmonic  components  near  the  fundamental  [Lan77,  Sch78],  By 
removing  undesired  harmonics  coming  after  the  fundamental,  the  low  pass  filtering 
specification  for  noise  shaping  will  be  relaxed.  A sharp  transition  curve  in  the  filter 
characteristic  is  not  required,  and  the  tuning  range  of  the  filter  cutoff  frequency  can 
be  narrowed.  Also,  to  reduce  hardware,  the  on-chip  storage  ROM  and  lengthy  shifter 
chain  registers  in  Chapter  3 can  be  replaced  by  an  LFSR.  Fig.  5.1  shows  a DC  offset 
square-wave  in  the  time-domain  and  the  frequency-domain.  Because  the  digital  pulse 
train  is  readily  generated  using  a shift  register  or  ring  oscillator,  the  key  work  is  in 
removing  the  undesired  harmonics,  as  shown  in  Fig.  5.1  (b). 

x(t)  X(f) 


time  DC  fb  3fb  5fb  7fb  freq. 

(a)  (b) 

Figure  5.1  Square-wave  pulse  (a)  in  the  time-domain,  and  (b)  in  the  frequency- 
domain 

Considering  the  characteristic  of  an  ideal  square-wave  pulse,  as  shown  in 
Fig.  5.1  (b),  one  way  of  removing  the  undesired  components  while  leaving  the 
desired  fundamental  is  to  place  the  zeros  on  the  undesirable  harmonic  locations.  For 
ease  of  design  we  overlap  the  zeros  and  poles  at  the  fundamental  frequency  and  the 


desired 
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even  harmonic  locations,  as  shown  in  Fig.  5.2  (a).  The  transfer  function  of  this  FIR 
filter  can  be  expressed. 

H(Z)  = 1-Z~n2Kf (5.1) 

( 1 - z-n/2)  x (,  -2cos(  ‘ j 7~  + Z“2J 

ls 

and,  its  time-domain  representation  when  n=16  is 

y(k)  = x(k)  + 1.8478x(k  - 1)  + 2.4142x(k-  2)  + 2.613 lx(k  - 3)  (5.2) 

2.4142  x(k  - 4)  + 1.8478  x(k  - 5)  + x(k  - 6) 

Square  pulses  generated  using  real  circuits  do  not  have  the  ideal  properties 

shown  in  Fig.  5.1  (b).  However,  the  pulses  have  some  even  harmonic  components, 

which  could  degrade  the  signal  quality.  To  avoid  this  degradation  in  signal  quality, 

as  shown  in  Fig.  5.2  (b),  the  zeros  of  FIR  filtering  are  required  at  the  even  harmonic 

locations,  as  well  as  the  odd  harmonics.  This  transfer  function  can  be  expressed 

H(z)  = (5.3) 

I - 2cos(-^-l)z-1  + z-2 

rs 

The  problem  with  this  filtering  is  that  there  are  negative  filter  coefficient  values,  and 
it  is  not  straightforward  to  implement. 

To  remove  the  negative  coefficients  of  filtering  in  Fig.  5.2  (b),  another 
pole  at  the  DC  point  is  added,  as  shown  in  Fig.  5.2  (c),  and  its  transfer  function  is 

H(z)  = 1-Z~n?  T (5-4) 

(1  -z_1)  x fl  -2cos^— p-^)z-1  + z_2j 

This  FIR  filtering  does  not  have  negative  coefficients,  as  shown  in  Table  4.1. 

Another  possible  way  to  remove  the  unwanted  harmonic  components 
associated  with  square-wave  pulses  is  using  cascaded  integrator-comb  (CIC)  filter 
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(a) 


(b) 


(c) 


(d) 

Figure  5.2  Harmonic  components  cancellation  methods  using  FIR  filtering  (a) 
with  zeros  at  odd  harmonics  (FIRO),  (b)  with  zeros  at  DC  and  all  the 
harmonics  except  the  fundamental  component,  (c)  with  zeros  at  all  the 
harmonics  except  the  fundamental  component  (FIR1),  and  (d)  with 
zeros  at  all  the  harmonics  including  the  fundamental  component. 
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[Tay98].  This  implementation  is  very  simple,  because  it  needs  only  the  addition  of 
sample  values  without  any  scaling,  as  in 

y(k)  = x(k)  + x(k  - 1)  + x(k  - 2)  + x(k- 3)  + ...  (5.5) 

Its  transfer  function  comes  from  a integrator  and  a comb  filter,  as  follows. 

H(z)  = (5.6) 

1 - z-1 

However,  this  simple  implementation  of  CIC  filtering  is  not  helpful  for 
embedded  source  generation,  because  of  the  unacceptably  low  gain  at  the 
fundamental  frequency,  as  shown  in  Fig.  5.2  (d). 


Table  4.1  Filter  coefficients  of  Fig.  5.2 


FIR  filtering  coefficient  when  n = 1 6 

FIR  in  Fig.4.2  (a) 

1.0000  1.8478  2.4142  2.6131  2.4142  1.8478  1.0000 

FIR  in  Fig.4.2  (b) 

1.0000  1.8478  2.4142  2.6131  2.4142  1.8478  1.0000  -0.0000 

-1.0000  -1.8478  -2.4142  -2.6131  -2.4142  -1.8478  -1.0000 

FIR  in  Fig.4.2  (c) 

1.0000  2.8478  5.2620  7.8751  10.2893  12.1371  13.1371  13.1371 

12.1371  10.2893  7.8751  5.2620  2.8478  1.0000 

FIR  in  Fig.4.2  (d) 

1111111111111111 

5.3  System  Performance  Consideration 
Assuming  a ideal  square-wave  and  perfect  FIR  filtering  followed  by  ideal 
low  pass  filter,  generated  signal  quality  may  be  estimated.  An  8-stage  rotator  chain 
register  triggered  by  system  clock  frequency  fclk  and  configured  with  a negative 
feedback  can  generate  a square-wave,  and  its  fundamental  frequency  is  (fc[k/16).  The 
subsequent  FIR  filter  designed  as  shown  in  Fig.  5.3  eliminates  the  intermediate 
harmonic  components,  leaving  the  fundamental  and  its  mirroring  copies.  In  this 
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example,  the  mirroring  copies  are  the  15th  harmonic  (15fb),  the  17th  harmonic 
(17fb),  the  31st  harmonic  (31fb),  the  33rd  harmonic  (33fb),  and  so  on.  Given  a 
second-order  low-pass  filter  having  a cutoff  frequency  of  1.5fb  for  post-processing 
of  FIR  filter  output,  the  magnitude  of  the  largest  harmonic  component  located  at  15fb 
can  be  calculated  as 

f 15fb\ 

signal  magnitude  = - 23.5dB  - 401ogf  — — j dB  = -63.5dB  (5.7) 

Since  the  fundamental  component  magnitude  is  OdB,  the  signal  quality 
represented  as  SFDR  is  more  than  60dB  in  this  ideal  case.  To  get  a result  close  to  this 
ideal  value,  however,  several  practical  issues  should  be  considered,  such  as  the 
square-wave  behavior,  FIR  filter  implementation,  and  post-low-pass  filtering  issues. 


Figure  5.3  Ideal  system  performance  estimation  using  2nd-order  post-low-pass 
filter. 
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5.3.1  Square- Wave  Behavior 

Ideal  square-waves  must  have  zero  rise/fall  time,  50%  duty  cycle,  and  no 
overshoot/undershoot  as  shown  in  Fig.  5.4  (a).  In  this  ideal  case,  the  resultant 
frequency  spectrum  has  only  fundamental  and  odd  harmonic  components.  Unlike  an 
ideal  square-wave,  practical  square-waves  have  some  non-ideal  properties  as 
follows. 

Practical  square-waves  have  finite  rise  and  fall  times,  because  the  real 
transistors  implementing  square-wave  generators  have  finite  propagation  delay. 
Given  specific  transistors  and  their  constant  propagation  delay,  the  undesired  effects 
from  finite  rise  and  fall  times  will  be  worse  as  the  square-waves  have  shorter  periods. 
Fig.  5.4  (b)  shows  the  20ns-period  square-wave  having  Ins  rise  and  Ins  fall  time.  As 
shown  in  the  frequency  spectrum  of  Fig.  5.4  (b),  even  harmonics  appear,  which 
makes  post-FIR  filtering  much  more  complicated  to  remove  the  unnecessary 
harmonic  components.  As  the  signal  frequency  goes  higher,  this  non-ideal  even 
harmonics  associated  with  this  square-waves  will  seriously  increase. 

A 50%  duty  cycle  is  also  another  critical  factor,  characterizing  an  ideal 
square-wave.  If  a square-wave  is  generated  without  maintaining  50%  duty  cycle,  its 
frequency  spectrum  shows  even  harmonics  as  well  as  odd  harmonic  components.  Fig. 
5.4  (c)  and  (d)  show  the  time-domain  and  the  frequency-domain  behaviors  of  a 
square-wave  when  its  duty  cycle  is  60%  and  40%  with  zero  rise/fall  time, 
respectively.  Fig.  5.5  (a)  and  (b)  show  the  case  when  the  non-50%  duty  cycle  is 
combined  with  finite  rise/fall  time. 

Finally,  unstable  power  or  ground  lines  may  deteriorate  the  generated 
square-wave  quality,  because  they  can  cause  signal  fluctuation  in  the  duration  time. 
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Figure  5.4  Square-wave  behavior;  (a)  ideal  square-wave,  (b)  practical  square- 
wave  with  finite  rise  and  fall  time,  (c)  60%  duty  cycle  square-wave, 
and  (d)  40%  duty  cycle  square-wave. 
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Figure  5.5  Square-wave  behavior  with  (a)  60%  duty  cycle  and  finite  rise/fall 
time,  and  (b)  40%  duty  cycle  and  finite  rise/fall  time. 


The  signal  fluctuation  directly  adds  additional  frequency  components  to  the  ideal 
characteristics  in  the  frequency  spectrum.  By  inserting  the  well-designed  bypass 
capacitors  between  the  power  and  the  ground  lines,  however,  the  signal  fluctuation 
can  be  remarkably  reduced. 

Unlike  the  behavior  of  an  ideal  square-wave,  practical  square-waves 
generated  using  integrated  circuit  technologies  shows  some  unexpected  even 
harmonic  components  as  well  as  expected  odd  harmonic  components.  This  is  the 
reason  we  have  to  consider  eliminating  both  odd  and  even  harmonic  components  in 


the  FIR  filtering  stage. 
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5.3.2  FIR  Filter  Implementation 

In  the  section  5.2,  several  FIR  filtering  techniques  have  been  considered 
to  reduce  the  unnecessary  harmonics.  The  FIR  filtering  methods  shown  in  Fig.  5.2 
efficiently  work  to  suppress  the  unwanted  frequency  components,  only  when  the  FIR 
filter  is  implemented  with  ideal  filter  coefficients  summarized  in  Table  4. 1 . The  filter 
coefficients  can  be  represented  as  several  types  of  circuit  design  factors,  such  as 
impedance  or  current  entities.  The  weighted  summation  of  a FIR  filter  is  based  on  the 
various  coefficient  values,  and  its  operation  and  reliability  is  dependent  on  the 
implementation  accuracy  of  its  FIR  coefficients. 

Analog  circuit  design  requiring  precise  impedance  matching  or  accurate 
control  over  the  current  flowing  through  specific  devices  has  suffered  from  circuit 
parameter  variation  caused  by  process  variation  or  temperature  drift.  These 
unavoidable  circuit  parameter  deviations  change  the  FIR  filter  coefficients,  making 
them  deviate  from  the  nominal  values  of  the  original  design.  The  effect  of  this  FIR 
coefficient  variation  has  been  estimated  through  simulation. 

Suppose  an  FIR  filter  which  has  the  following  time-domain  representation 
y(k)  = d0  + d,x(k- 1)  + d2x(k- 2)  + ...  + d13x(k- 13)  (5.8) 

If  the  nominal  filter  coefficient  value  set  [do,d],d2,d3,d4,d5,d6,d7,d8,d9, 
d10,d11,d12,d13]  = [d0,d1,d2,d3,d4,d5,d6,d6,d5,d4,d3,d2,d1,d0]  displays  the  frequency 
response  shown  in  case  (a)  of  Fig.  5.6,  then  the  zeros  of  its  transfer  function  are 
located  on  the  same  position  as  the  undesired  harmonic  components.  Therefore,  the 
filter  coefficients  drastically  reduce  the  magnitude  of  the  unwanted  components. 
However,  inevitable  process  variation  or  unpredictable  environmental  changes  can 
alter  the  circuit  parameters,  which  can  also  warp  the  FIR  filter  response.  Fig.  5.6 
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shows  some  catastrophic  cases  with  10%  filter  coefficient  variation.  In  case  (b)  and 
(d),  the  filter  response  is  no  longer  what  it  was,  and  case  (c)  loses  one  of  the  zeros  it 
should  have.  The  lost  zero  results  in  the  resurrection  of  the  unwanted  harmonic 
components  located  on  the  missing  zero  position. 

Fig.  5.7  shows  more  deviant  cases,  in  which  case  (b)  and  case  (c)  show  the 
results  when  the  first  6 largest  and  smallest  coefficients  are  increased  by  10%, 
respectively.  The  frequency  responses  from  case  (b)  and  case  (c)  can  not  be  used  for 
harmonic  component  cancellation,  because  it  has  no  zero  at  the  location  where  it 
should  be.  It  should  be  noted  that  case  (d)  and  case  (e)  follows  very  closely  to  the 
nominal  frequency  response  of  case  (a),  except  a slight  gain  difference. 

Fortunately,  from  Fig.  5.6  and  Fig.  5.7,  we  know  that  the  partly-changed 
FIR  coefficient  set  affects  harmonic  cancellation  more  seriously.  (The  responses 
shown  in  the  case  (d)  and  (e)  of  Fig.  5.7  are  more  efficient  in  harmonic  cancellation 
than  other  cases  shown  in  Fig.  5.6  and  Fig.  5.7.)  These  locally  variable  design 
parameters  may  be  reduced  by  using  the  common-centroid  layout  techniques 
[HasOl], 

With  the  maximum  10%  variation  of  coefficient  values,  the  worst 
combination  of  FIR  coefficients  was  identified  using  MATLAB.  To  figure  out  the 
worst  variation  of  FIR  coefficients,  MATLAB  finds  the  coefficient  combination 
searching  for  the  smallest  gain  difference  between  the  fundamental  and  the  dominant 
harmonic  components. 

The  whole  system  is  simulated  with  the  identified  coefficient  combination 
using  the  Cadence  SpectreS.  The  frequency  spectrum  result  of  the  FIR  filter  designed 
in  Fig.  5.2  (a)  is  shown  in  Fig.  5.8  (a).  The  upper  and  lower  graphs  represent  the  DFT 
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Figure  5.6  FIR  filter  coefficient  variation  effects; 
case  (a):  nominal  coefficient  value  set 

[do,dj,d2,d3,d4,d5,d6,d6,d5,d4,  d3,d2,dj,d0] 
case  (b):  half  plane  variation  by  +10% 

[d0,d ! ,d2,d3,d4,d5,d6, 1 . 1 d6, 1 . 1 d5, 1 . 1 d4, 1 - 1 d3, 1 . 1 d2, 1 . 1 d , , 1 . 1 d0] 
case  (c):  zigzag  variation  by  +10% 

[d0, 1 . 1 d,,d2, 1 • 1 d3,d4, 1 . 1 d5,d6, 1 . 1 d6,d5, 1 • 1 d4,d3, 1 . 1 d2,d  b 1 . 1 d0] 
case  (d):  symmetric  zigzag  variation  by  +10% 

[do,  1 . 1 d ] ,d2, 1 • 1 d2,d4, 1 . 1 d5,d6,do,  1 . 1 d5,d4, 1 . 1 d2,d2, 1 . 1 d j ,dg] 
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Figure  5.7  FIR  filter  coefficient  variation  effects; 
case  (a):  nominal  coefficient  value  set 

[d0,d i ,d2>d3>d4,d5,d6>d645>d4>  d3,d2>d bd0] 

case  (b):  first  6 largest  coefficient  value  variation  by  +10% 

[ 1 . 1 do,  1 . 1 d i , 1 . 1 d2,d3,d4>d5,d6>d6,d5,d4,  d3, 1 . 1 d2, 1 . 1 d i , 1 . 1 dg] 

case  (c):  first  6 smallest  coefficient  value  variation  by  +10% 

[d0,di ,d2,d3, 1 . 1 d4, 1 . 1 d5, 1 . 1 d6, 1 . 1 d6, 1 . 1 d5, 1 . 1 d4,  d3,d24i ,d0] 
case  (d):  nominal  coefficient  value  variation  by  +30% 

[1.3do,1.3d1,1.3d2,1.3d3,1.3d4,1.3d5,1.3d6,1.3d6,1.3d5,1.3d4,1.3d3,1.3d2 

,1.3db1.3d0] 

case  (e):  nominal  coefficient  value  variation  by  -30% 

[0.7d0,0.7d1,0.7d2,0.7d3,0.7d4,0.7d5,0.7d6,0.7d6,0.7d5,0.7d4,0.7d3,0.7d2 

,0.7d,,0.7d0] 
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dB20(dft(VT("/asqout")  3.76e-08  2.456e-07  512  "Rectangular"  1 1)) 


(a) 


(b) 

Figure  5.8  Circuit  simulation  results  showing  coefficient  variation  effects  of  FIR 
filters  (a)  shown  in  Fig.  5.2  (a);  nominal  value  of  coefficients  (top) 
and  worst  combination  with  ± 10%  coefficient  variation  (down),  and 
(b)  shown  in  Fig.  5.2  (c);  nominal  value  of  coefficients  (top)  and 
worst  combination  with  ±10%  coefficient  variation  (down) 
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results  of  generated  sources  using  nominal  value  coefficients  and  worst-case 
coefficients,  respectively.  The  generated  signal  quality  with  10%  error  in  the  FIR 
coefficients  is  about  lOdB  below  the  nominal  value  results.  This  is  because  the 
nonideal  FIR  coefficients  move  the  location  of  system  zeros  away  from  the  originally 
designed  position  of  the  unwanted  harmonic  component  to  be  eliminated,  and  then 
can  not  cancel  the  harmonics  out  effectively.  The  same  simulation  has  been  done 
with  the  FIR  filter  shown  in  Fig.  5.2  (c),  and  the  frequency-domain  result  is  shown 
in  Fig.  5.8  (b).  The  signal  quality  loss  due  to  10%  variation  of  FIR  coefficients  is 
about  lOdB,  as  in  Fig.  5.8  (a). 

5.3.3  Post-Low-Pass  Filtering  Issues 

A low-pass  filter  attenuates  the  frequency-mirroring  components,  which 
can  not  be  cancelled  out  by  FIR  filtering  techniques.  As  shown  in  the  ideal  case  of 
Fig.  5.3,  these  mirrored  components  at  (m  x fs)  ± fj , (where  m is  integer  and  greater 
than  zero,  fs  is  a system  clock  frequency,  and  fj  is  a desired  fundamental  frequency) 
become  the  dominant  harmonics,  assuming  the  intermediate  harmonics  are  nullified 
by  an  FIR  filter.  Since  signal  quality  created  using  the  LFSR  approach  depends  on 
the  magnitude  of  the  initial  uncanceled  mirrored  harmonic  component,  it  is  necessary 
to  have  a low-pass  filtering  process  in  the  final  stage. 

One  advantage  with  the  LFSR  approach  using  an  FIR  filter  is  that  the 
filtering  specification  can  be  much  relaxed,  because  the  harmonic  components  near 
the  fundamental  frequency  are  already  eliminated  or  effectively  degraded  by  FIR 
filters.  Unlike  the  memory-based  approach  mentioned  in  Chapter  3 and  Chapter  4, 
the  LFSR  approach  does  not  require  the  tunable  post-filtering,  which  gives  the 
designer  a lot  of  options  in  designing  the  post-filter  stage. 
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5.4  Overall  Structure 

5.4,1  Single-Tone  Generation 

The  overall  structure  for  generating  a single-tone  source  is  shown  in  Fig. 
5.9.  It  has  3 major  blocks,  which  are  a LFSR  block  to  generate  a square-wave,  a 
weighted  summation  block  to  execute  the  FIR  filtering  operation,  and  a low-pass 
filter.  The  LFSR  is  configured  with  a feedback  connection  from  the  inverted  output 
of  the  last  cell  to  the  input  of  the  first  cell,  as  shown  in  Fig.  5.9.  The  signals  tapped 
in  the  middle  of  the  LFSR,  such  as  x(k  - 1 ),  x(k  - 2)  and  x(k  - n) , represent  the 
delayed  inputs,  and  x(k)  represents  the  present  input  value.  These  input  values  should 
be  scaled  by  the  amount  corresponding  to  the  FIR  filter  coefficients,  and  then  added 
together  to  get  the  desired  signal  without  the  unwanted  harmonics. 

The  weighted  summation  block  introduces  weighted  factors  for  each  of 
tapped  signals  from  the  LFSR,  and  then  scales  and  adds  them  in  1-bit  continuous 
level,  not  a multi-bit  digital  level.  To  execute  addition  with  a 1-bit  wide  structure, 
“current”  magnitude  or  impedance  can  be  considered  as  arithmetic  variables 
representing  weighted  factors.  The  digital  signals  x(k),  x(k-l),...,  x(k-n),  are 
converted  and  scaled  to  “current”  magnitudes  for  further  addition  operation. 

To  generate  scaled  current  magnitudes  from  digital  signals,  passive 
elements  (such  as  resistors)  or  active  devices  (such  as  transistors)  can  be  used.  For 
example,  an  MOS  transistor  can  be  used  as  a voltage-to-current  converter  with  a 
input  applied  to  its  gate  node,  generating  a current  at  the  drain  node.  Since  the 
amount  of  drain  current  can  be  scaled  by  the  transistor  size,  the  various  weighting 
factors  may  be  implemented  using  MOS  transistors  with  various  W/L  ratio’s. 
However,  MOS  transistors  are  vulnerable  to  substrate  noise  originating  from  noisy 
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Figure  5.9  Block  diagram  of  LFSR  approach  for  single-tone  source  generation 
using  FIR  filtering 

digital  blocks,  and  the  signal  quality  generated  using  active  transistors  is  limited  by 
substrate  crosstalk.  Also,  MOS  transistors  incorporate  with  several  parasitic  junction 
capacitors,  such  as  gate-to-source  or  gate-to-drain  capacitance,  causing  coupling 
problems  between  neighboring  nodes. 

Another  method  to  implement  weighted  summation  is  using  passive 
resistors.  The  resistor  can  be  fabricated  on  2-dimensional  plane  with  less  3- 
dimensional  interference  from  substrate  fluctuation.  Although  many  passive 
elements  suffer  from  fabrication  variation  and  temperature  effects,  the  resistor  may 
be  a good  choice,  because  only  relative  (localized)  manufacturing  differences 
between  nearby  resistors  on  the  same  IC  will  damage  the  output  quality.  In  other 
words,  as  seen  in  Fig.  5.7,  the  overall  deviation  of  IC  resistance  value,  which  is 
representing  the  FIR  coefficients,  from  its  nominal  value  does  not  spoil  the  generated 
signal  output.  With  carefully  drawn  common-centroid  layout  of  passive  resistors,  this 
method  may  implement  the  LFSR  approach  easier  than  using  active  devices.  These 
resistance  values  are  determined  from  the  filter  coefficients  calculated  in  the  section 
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5.2.  The  resistance  values  can  be  extracted  from  the  FIR  filter  coefficients  by  taking 
their  reciprocals,  because  the  current  represents  the  coefficient  ratio  in  a current 
summing  circuit. 

Given  a variable  clock  source  to  control  the  shifting  operation  of  the 
shifter  in  Fig.  5.9,  the  fundamental  frequency  (f])  can  be  variable  with  proper  post- 
filtering. Since  most  of  the  unwanted  harmonic  components  are  eliminated  by  FIR 
filtering,  the  low-pass  filter  following  the  weighted  summation  block  does  not  have 
to  be  tunable  in  cutoff  frequency.  The  suppressed  intermediate  harmonic  components 
enable  a simple  constant  frequency  RC  low-pass  filter  to  effectively  reduce  the 
fundamental-mirrored  harmonic  components.  Fig.  5.10  shows  an  LFSR  combined 
with  weighted  summation  resistor  array  and  a post-RC  filter.  Given  the  FIR 
coefficient  set  [dj,  d2,  d3,...],  the  corresponding  resistance  value  set  should  be  [m/dj, 
m/d2,  m/d3,...],  where  m is  an  arbitrary  scale  factor.  From  Fig.  5.10,  we  know  that 

— = R,  +2R  (5.9) 

di 

— = R2  + 2R  (5.10) 

d2 


(R,+2R)  R, 
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(R3+2R)  R3 
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-I  > out 
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R « R,,  R2,  R3,  ... 


Figure  5.10  An  LFSR  combined  with  weighted  summation  resistor  array  and  a 
post-RC  filter 
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and  so  on.  With  fixed  resistance  R,  resistor  array  [Rj,  R2,  R3,...]  can  be  determined 
using  the  above  relationship. 

5.4.2  Multi-Tone  Generation 

As  seen  in  Fig.  5.1,  a square-wave  pulse  has  one  dominant  frequency 
component  (fj),  and  other  components  decrease  in  magnitude  as  the  frequency  goes 
up.  Therefore,  only  a single-tone  signal  could  be  generated  with  this  scheme.  In  order 
to  generate  multi-tone  signals,  multiple  shift  chains  must  be  built,  as  in  the  dual-tone 
generation  case  of  Fig.  5.11.  For  the  example  of  dual-tone  signal  generation,  both 
LFSR1  and  LFSR2  have  a different  shift  chain  lengths,  as  shown  in  Fig.  5.11, 
because  they  are  being  synchronized  with  the  same  clock.  Therefore,  the  dual-tone 
output  of  weighted  summation  y(k)  can  be  expressed 

y(k)  = {Xj(k)  + v1x1(k-l)  + v2x,(k-  2)  + ...  + vnx,(k-  m)}  (5.11) 

+ |x2(k)  + WjX2(k-  1)  + w2x2(k  - 2)  + ...  + wnx2(k  - n) } 


Figure  5.1 1 Example  of  dual-tone  generation  using  LFSR  approach.  Both  LFSR1 
and  LFSR2  must  have  chains  of  different  lengths. 
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Fig.  5.12  shows  an  example  of  harmonic-based  dual-tone  generation 
scheme  using  two  different  system  clocks.  Since  both  two  tones  are  harmonically 
related,  a secondary  clock  can  be  generated  by  using  the  clock  frequency  divider,  as 
shown  in  Fig.  5.12.  And,  the  length  of  both  LFSRs  are  identical,  because  the 
fundamental  frequencies  of  both  LFSRs  are  determined  by  system  clock  difference, 
not  by  shift  chain  length. 


Figure  5.12  Example  of  harmonic-based  dual-tone  generation  using  LFSR 
approach.  Both  LFSR1  and  LFSR2  must  be  identical. 

5.5  IC  Implementation  And  Measurement 
5.5.1  IC  Implementation  And  Simulation 

The  signal  source  generation  method  using  LFSR  approach  has  been 
designed  as  shown  in  Fig.  5.13,  which  includes  a variable  clock  generator,  two 
single-tone  source  generation  blocks,  a dual-tone  source  generation  block,  and  an 
internal  clock  monitoring  down-counter.  Each  individual  sub-block  for  signal 
generation  (“block  (a)”,  “block  (b)”,  “block  (c)”  in  the  schematic  of  Fig.  5.13) 
consists  of  a square-wave  generator,  an  FIR  filtering  circuit  block,  and  a low-pass 
filter.  A square-wave  is  generated  using  an  LFSR,  and  its  chain  length  is  varied 
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depending  on  the  FIR  filtering  method.  The  transfer  function  shown  in  Fig.  5.2  (a)  is 
chosen  to  implement  the  FIR  filtering  circuit  of  block  (a)  in  Fig.  5.13,  which 
eliminates  odd  harmonic  components  only  (FIRO).  The  other  single-tone  signal 
generation  block  (b)  uses  the  FIR  filtering  shown  in  Fig.  5.2  (c),  which  removes  both 
even  and  odd  harmonic  components  to  make  better  signal  quality  at  the  expense  of  a 
two-times  longer  LFSR  chain  length  (FIR1).  In  block  (c),  a dual-tone  signal  source 
is  generated  using  two  different  LFSR  chains  and  two  independent  FIR  filtering 
circuits. 

The  FIR  filtering  circuits  are  implemented  using  passive  poly  resistor 
arrays,  because  it  is  easier  to  mix  up  with  other  digital  circuits  for  the  following 
reasons. 


Figure  5.13  Top-level  schematic  (Rev. 3.0)  of  signal  source  generator  using  LFSR 
approach.  It  includes  a variable  clock  generator,  2 single-tone  source 
generators  (block  (a)  and  (b)),  and  a dual-tone  source  generator  (block 

(c)). 
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From  the  manufacturing  viewpoints,  passive  poly  resistors  are  2- 
dimensional  circuit  elements  while  other  active  devices,  such  as  MOS  transistors,  are 
3-dimensional  entities.  In  other  words,  a poly  resistor  is  a 2-terminal  element,  which 
is  independently  fabricated  off  the  substrate.  Since  the  bulk  node  of  MOS  transistors 
is  defined  with  substrate  node,  however,  it  is  sensitive  to  substrate  voltage 
fluctuation.  Considering  the  unstable  substrate  voltage  in  many  mixed-signal 
circuits,  it  is  not  easy  to  implement  sophisticated  function  blocks  with  substrate- 
dependent  devices. 

Secondly,  active  transistor  circuits  have  limited  input  voltage  swing  range 
to  ensure  every  transistor  is  operating  in  saturation  mode.  In  order  to  scale  a rail-to- 
rail  swinging  digital  signal  down  to  this  limited  range,  additional  circuits  are 
required.  And,  higher  operating  frequency  may  cause  extra  undesirable  parasitic 
effects  within  the  device  itself  and  coupling  problems  between  neighboring  devices. 

The  Cadence  SpectreS  simulation  results  for  the  designed  signal 
generation  circuits  in  Fig.  5.13  are  shown  in  Fig.  5.14.  The  top  and  bottom  figures  in 
Fig.  5.14  (a)  show  the  frequency  spectrums  of  single-tone  signal  sources  generated 
by  block  (a)  and  block  (b)  shown  in  Fig.  5.13,  respectively.  As  described  in  Fig.  5.2 
(a),  the  signal  output  (“sigoutl”)  of  block  (a)  in  Fig.  5.13  should  have  odd  harmonic 
components  as  well  as  fundamental  and  its  mirrored  fundamental  components.  As 
shown  in  the  simulation  result  of  Fig.  5.14  (a),  the  signal  “sigoutl”  has  a 2nd 
harmonic  component  which  is  larger  than  the  mirrored  fundamental  component  at 
(fs  — f j ) , where  fs  is  the  system  clock  frequency.  This  dominant  2nd  harmonic 
frequency  deteriorates  the  generated  signal  quality,  SFDR. 
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Figure  5.14  Cadence  SpectreS  simulation  results  of  (a)  single-tone  signal  sources 
and  (b)  a dual-tone  signal  source.  The  2nd  harmonic  component  in  the 
top  of  (a)  is  the  dominant  one,  while  the  mirrored  fundamental  at 
(fs  - fj)  is  the  biggest  harmonic  in  the  bottom  of  (a). 
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By  using  the  FIR  filtering  explained  in  Fig.  5.2  (c),  these  unwanted  odd 
harmonic  components  can  be  suppressed,  as  shown  in  the  bottom  of  Fig.  5.14  (a), 
where  the  2nd  harmonic  frequency  is  being  controlled  to  be  less  than  the  mirrored 
fundamental  frequency  at  (fs-fj).  In  this  case,  the  SFDR  is  determined  by  the 
unavoidable  component  at  (fs  - fj) , not  by  the  2nd  harmonic  component. 

The  single-tone  fundamental  frequency  (fj ) can  be  scaled  by  changing  the 
system  clock  frequency  (fs)  by  the  following  relation, 


2 x (LFSR  length  in  block  (a)) 


(5.12) 


f, 

(LFSR  length  in  block  (b)) 


(5.13) 


The  simulated  result  of  generated  dual-tone  signal  (“sigout3”)  from  the 
block  (c)  in  Fig.  5.13  is  shown  in  Fig.  5.14  (b).  The  dual-tone  generation  circuit  is 
designed  as  shown  in  Fig.  5.11,  where  2 different  LFSRs  share  the  unique  system 
clock.  Therefore,  the  dual  fundamental  tones,  fdual,  l and  fdual,2>  are  determined  from 
the  LFSR  chain  length,  which  are 


1dual,l 


2 x (LFSR1  length) 


(5.14) 


‘dual, 2 


2 x (LFSR2  length) 


(5.15) 


By  taking  different  length  for  LFSRs  or  changing  the  system  clock  frequency,  the 
dual-tone  signal  can  also  be  adjusted.  The  full  chip  layout  floorplan  is  shown  in  Fig. 
5.15,  where  the  single-tone  generators  (FIRO)  and  (FIR1)  are  representing  the  block 
(a)  and  (b)  of  the  schematic  in  Fig.  5.13,  respectively. 
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Figure  5.15  Full  chip  layout  floorplan  for  signal  source  generators  (Rev. 3.0)  with 
three  major  blocks,  including  two  single-tone  source  generation 
blocks  and  one  dual-tone  generation  block.  These  are  integrated 
together  using  0.25pm  IBM  6HP  BiCMOS  process.  The  single-tone 
generator  (FIRO)  and  (FIR1)  are  implemented  using  FIRs  explained  in 
Fig.  5.2  (a)  and  (c),  respectively.  Two  different  independent  single- 
tone generation  circuits  are  combined  to  make  the  dual-tone  generator 
shown  above,  sharing  the  system  clock  and  the  low-pass  filter.  The 
total  chip  area  including  I/O  pads  is  (806pm  x 863pm) . 

5.5.2  IC  Measurement  And  Analysis 

The  fabricated  wafer  samples  have  been  measured  in  two  different  ways, 
one  is  measuring  the  bare  wafer  samples  using  a probestation,  and  the  other  is 
measuring  the  package  samples  using  a test  board.  Fig.  5.16  shows  the  measurement 
environment  including  the  test  board  designed  for  testing  the  fabricated  signal 
generator  (Rev. 3.0).  The  measured  signal  output  is  captured  in  the  time-domain  by 
the  digital  oscilloscope  and  transformed  to  the  frequency-domain.  The  resultant 
frequency  spectrum  of  single-tone  generation  using  FIR  filtering  shown  in  Fig.  5.2 
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Figure  5.16  Illustration  of  the  package  sample  measurement  setup  for  the  signal 
generator  (Rev. 3.0)  using  the  proposed  LFSR  approach  with  FIR 
filtering 

(a)  is  shown  in  Fig.  5.17.  The  bare  wafer  samples  have  been  measured  by  directly 
probing  the  I/O  pads  on  the  ICs  using  a probestation  without  the  aid  of  bypass 
capacitors.  The  measured  result  in  the  frequency-domain  is  shown  in  Fig.  5.18.  The 
signal  frequency  of  the  generated  single-tone  source  is  adjustable  by  changing  the 
internal  clock  speed  as  mentioned  in  Chapter  3. 

The  generated  fundamental  frequency  of  single-tone  sources  is  shown  in 
Fig.  5.19.  The  on-chip  ring  oscillator  having  voltage-controlled  load  capacitors 
generate  the  internal  system  clock,  which  gets  slower  as  the  control  voltage 
increases.  Since  the  variable  clock  is  related  to  the  fundamental  frequency  by  Eq. 
(4.12)  or  (4.13),  the  generated  signal  frequency  also  decreases  with  the  increasing 
control  voltage  “Vcnt”.  The  measured  signal  quality  (SFDR)  of  both  package  samples 
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Figure  5.17  Measured  frequency  spectrums  of  the  single-tone  signal  generator 
using  LFSR  (FIRO)  approach  (Rev. 3.0);  when  the  control  voltage  Vcnt 
is  (a)  1.0V,  (b)  1.5V,  (c)  2.0V,  (d)  2.5V,  (e)  3.0V,  (f)  3.5V,  and  (g) 
4.0V.  Measurement  was  done  using  package  samples  loaded  onto  a 
test  board  with  external  bypass  capacitors.  The  measured  SFDRs  are 
(a)  40.6dB,  (b)  40.0dB,  (c)  38.1dB,  (d)  38.8dB,  (e)  37.2dB,  (f) 
41.3dB,  and  (g)  44.4dB. 
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Figure  5.17  Continued 
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Figure  5.18  Measured  frequency  spectrums  of  the  single-tone  signal  generator 
using  LFSR  (FIR1)  approach  (Rev. 3.0);  when  the  control  voltage  Vcnt 
is  (a)  2.0V,  (b)  2.2V,  (c)  2.5V,  (d)  2.8V,  (e)  3.0V,  (f)  3.3V,  and  (g) 
3.5V.  Measurement  was  done  using  wafer  samples  loaded  onto  a 
probestation  without  bypass  capacitors.  The  measured  SFDRs  are  (a) 
24.4dB,  (b)  28.1dB,  (c)  35.0dB,  (d)  32.5dB,  (e)  27.8dB,  (f)  33.1dB, 
and  (g)  30.6dB. 
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Figure  5.18  Continued. 
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Figure  5.19  Measured  single-tone  frequency  of  the  signal  generator  using  LFSR 
approach  (Rev. 3.0).  As  the  control  voltage  (Vcnt)  increases,  the 
internal  clock  becomes  slower,  making  generated  the  signal 
frequency  decrease. 

and  wafer  samples  is  summarized  and  compared  in  Fig.  5.20.  As  mentioned  in  the 
section  5.4.1,  two  different  types  of  FIR  filtering  (FIRO  and  FIR1)  have  been 
fabricated  and  measured.  Since  FIRO  filtering  (as  seen  in  Fig.  5.2  (a))  can  not 
eliminate  the  even  harmonic  components  while  FIR1  filtering  (as  seen  in  Fig.  5.2  (c)) 
can  filter  out  all  the  unnecessary  harmonics,  the  signal  generated  through  the  FIR1 
filtering  is  expected  to  have  better  SFDR  than  the  signal  output  from  the  FIRO 
filtering  block.  From  the  circuit  simulation  results  shown  in  Fig.  5.14  (a),  we  know 
that  the  FIRl-type  signal  generator  makes  signal  sources  with  roughly  lOdB  higher 
SFDR  than  the  FIRO-type  circuit  does. 

As  seen  in  Fig.  5.20,  however,  the  board  measurement  results  of  the  FIRO- 
type  single-tone  signal  generator  shows  better  SFDR  than  the  FIRl-type  of  design. 
This  may  be  explained  as  follows. 
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Figure  5.20  Measurement  results  (SFDR)  of  signal  generators  using  LFSR 
approach  (Rev. 3.0);  The  board  test  comes  with  external  bypass 
capacitors  to  remove  the  high  frequency  noise  present  in  the  DC  and 
power  supply  lines,  but  the  probe  test  does  not  have  any  bypassing 
scheme. 

1)  The  FIRO-type  design  requires  fewer  filter  coefficients  than  the  FIR1- 
type  does,  as  shown  in  the  Table  5.1.  Therefore,  the  FIRO-type  signal  generator  can 
be  implemented  with  more  compact  and  simpler  hardware,  reducing  the  probability 
of  manufacturing  error  in  the  resistor  values. 

2)  Depending  on  the  value  range  of  FIR  coefficients,  the  FIR  filter 
becomes  more  or  less  vulnerable  to  device  mismatch  effects,  i.e.  a larger  coefficient 
value  range  will  result  in  more  probable  coefficient  deviation  from  the  nominal 
value. 


3)  The  FIR  1 -type  signal  generator  has  two-times  longer  shift  chain  length 
than  the  FIRO-type.  It  requires  larger  digital  circuit  blocks,  causing  more  critical 
clock  skew  problem.  The  distortion  of  the  square-wave  generated  by  the  shift  chain 
circuit  has  direct  impact  on  signal  generation,  as  mentioned  in  the  section  5.3.1. 

Another  error  source  can  be  explained  from  the  unmatched  output 
resistance  of  each  register  cell  of  a LFSR  chain.  The  circuit  shown  in  Fig.  5.10  can 
be  redrawn  in  Fig.  5.21.  The  “on”  resistance  (Routp  and  Routn)  is  approximated  as 
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Figure  5.21  Redrawn  LFSR  output  stage  combined  with  weighted  summation 
resistor  array  and  a post-RC  filter. 
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(5.16) 


where  Vdd  is  supply  voltage,  X is  channel  length  modulation,  and  is  transistor 
saturation  current,  which  is  described  as 


t n w 

W = B x Cox  x Y x 


(^dd  _ VT)  x Vdsat 


(Vdsat) 


2\ 


(5.17) 


where  p is  carrier  mobility,  Cox  is  oxide  capacitance  per  unit  area,  W is  gate  width, 
L is  gate  length,  VT  is  transistor  threshold  voltage,  and  Vdsat  is  drain  saturation 
voltage  [Rab03],  By  taking  the  saturation  current  value  (Idsat)  from  *he  circuR 
simulation  and  the  channel  length  modulation  (A.)  from  the  manual  model  parameters 
of  generic  0.25  pm  CMOS  process,  the  transistor  “on”  resistance  (Routp  f°r  PMOS 
and  Routn  for  NMOS)  is  calculated  using  Eq.  (5.16)  as  follows. 
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r»«p = = 334n  <5,8) 

= !xSj,<(1-r0'06(V"')>‘3-3(V)) = 498n  (519> 

(In  the  above  calculation,  the  channel  length  modulation  is  an  approximated  value, 
because  BICMOS  technology,  not  pure  CMOS  technology,  was  used  for  this 
implementation,  and  its  minimum  gate  length  is  larger  than  0.25  pm.)  This 
unmatched  resistance  value  adds  offset  to  the  original  resistor  array.  And,  from  the 
implementation  shown  in  Fig.  5.21,  the  resistance  RA  is  approximated  (Rj+2R). 
However,  the  absolute  value  of  RA  is  calculated  as 

Ra  = R1+(2R||Rb)  (5.20) 

even  though  RB  is  much  larger  than  (2R).  Furthermore,  the  resistance  RB  varies, 
depending  on  the  position  of  the  concerned  RA. 

Fig.  5.22  shows  the  generated  signal  source  quality  in  the  frequency 
domain  when  the  FIR  coefficients  are  changed  with  ±20%  variation.  It  should  be 
noted  that  in  this  case,  the  FIRO  signal  quality  is  slightly  better  than  the  FIR1  one. 
This  situation  can  explain  the  measurement  results  in  which  the  FIRO  results  are 
better  than  FIR1.  From  this  simulation  and  measurement  results,  we  know  that  the 
FIR1  design  is  more  sensitive  to  the  FIR  coefficient  variation. 

The  dual-tone  source  generation  circuit  is  also  measured  using  both 
package  samples  and  bare  wafer  samples.  Fig.  5.23  shows  the  measured  results  from 
the  package  samples,  displaying  the  time-domain  and  frequency-domain  response  of 
the  generated  dual-tone  signal.  Both  results  show  about  42dB  and  37dB  SFDRs,  but 
the  package  samples  do  not  have  wide  voltage  range  for  tuning  the  frequency.  The 
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Figure  5.22  Circuit  simulation  results  showing  20%  coefficient  variation  effects 
of  FIR.  The  FIR1  shows  34dB  SFDR  (top),  and  the  FIRO  shows  35dB 
SFDR  (down). 

generated  dual-tone  sources  are  severely  distorted  in  the  range  of  less  than  3 V control 
voltage. 

The  measured  dual-tone  frequency  spectrum  using  bare  wafer  samples  is 
shown  in  Fig.  5.24.  A couple  of  design  issues  are  identified  from  the  measurement 
results.  First,  two  fundamental  components  are  not  of  the  same  magnitude.  This 
unequal  dual-tone  signal  results  for  the  following  reasons; 

1)  different  FIR  filter  gain  at  the  frequency  locations  of  the  dual  tones 

2)  different  low  pass  filter  gain  within  the  band  of  interest. 

For  applications  requiring  precise  magnitude  control,  this  uneven  magnitude  dual- 
tone source  could  be  a problem. 

Second,  undesired  noise  components  are  present  within  the  low  frequency 
band  of  interest,  mostly  which  is  considered  as  an  outcome  of  intermodulation  of 
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Figure  5.23  Measurement  results  of  the  dual-tone  signal  generator  using  LFSR 
(FIRO)  approach  (Rev. 3.0).  (a)  captured  time-domain  waveform  and 
(b)  its  frequency  spectrum  measured  from  sample  #1  when  the  control 
voltage  Vcnt=3.9V,  and  (c)  captured  time-domain  waveform  and  (d) 
its  frequency  spectrum  measured  from  sample  #2  when  the  control 
voltage  Vcnt=4V.  The  package  samples  are  used  for  measurement 
under  the  environment  shown  in  Fig.  5.16.  The  measured  SFDRs  are 
(b)  42.2dB  and  (d)  36.6dB. 
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Figure  5.24  Measured  frequency  spectrums  of  wafer  samples  of  the  dual-tone 
signal  generator  using  LFSR  (FIRO)  approach  (Rev. 3.0);  when  the 
control  voltage  Vcnt  is  (a)  1.5V,  (b)  1 .8V,  (c)  2.0V,  (d)  2.3V,  (e)  2.5V, 
(f)  2.8V,  (g)  3.0V,  and  (h)  3.3V.  Measurement  was  done  using  wafer 
samples  loaded  onto  a probestation  without  bypass  capacitors.  The 
measured  SFDRs  are  (a)  24.7dB,  (b)  24.7dB,  (c)  25.0dB,  (d)  29.4dB, 
(e)  30.9dB,  (f)  26.9dB,  (g)  25.6dB,  and  (h)  29.1dB. 
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dual  fundamental  tones.  In  worst  case,  these  intermodulated  baseband  noise  tones  are 
more  critical  than  the  dominant  harmonic  component,  as  in  the  case  (d)  of  Fig.  5.24. 
To  avoid  this  intermodulated  distortion,  efficient  isolation  schemes  for  individual 
LFSR  chains  and  their  processing  blocks  before  weighted  summation  are  required. 
The  measured  SFDR  of  a wafer  sample  dual-tone  signal  generator  is  shown  in  Fig. 
5.25. 


► dual-tone  signal  probe  test 


Figure  5.25  Measured  SFDR  of  the  dual-tone  signal  generator  using  LFSR  (FIRO) 
approach  (Rev. 3.0) 


5.6  Summary 

In  this  chapter,  the  circuit  design  and  implementation  of  the  signal 
generator  using  LFSR  chain  registers  and  FIR  filtering  was  described.  By 
simplifying  the  digital  circuit  blocks  and  by  using  the  passive  noise-shaping  filter, 
this  method  helps  alleviate  the  design  difficulties  of  the  signal  generator  using  the 
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memory-based  periodic  bitstream  method  discussed  in  Chapter  3 and  Chapter  4.  The 
passive  filter  comes  with  more  reliable  substrate  noise  immunity  and  eliminates  the 
necessity  of  digital  signal  scaling  to  interface  the  analog  circuit.  The  single-tone 
measurement  showed  about  40dB  SFDR  and  30dB  SFDR  of  FIRO  and  FIR1  design, 
respectively.  Unlike  the  simulation  results,  FIR1  design  generates  poorer  quality 
signal  sources  than  FIRO  with  the  following  reasons. 

(1)  The  FIR1  design  has  more  filter  coefficients  than  the  FIRO. 

(2)  The  FIR1  design  has  larger  coefficient  value  range. 

(3)  The  FIR1  design  has  two-times  bigger  digital  circuit  blocks. 

And,  the  circuit  simulation  with  ±20%  FIR  coefficient  variation  shows 
that  the  signal  source  generated  by  FIR1  design  could  be  worse  than  the  FIRO.  From 
the  simulation  and  measurement  results,  FIR1  is  more  sensitive  to  FIR  coefficient 
deviation. 

Because  the  FIR1  design  has  more  efficient  filtering  scheme  for 
suppressing  the  unwanted  harmonic  components,  however,  its  signal  quality  could  be 
improved  by  1)  more  reliable  realization  of  weighted  summation  circuit  for  FIR 
filtering,  and  2)  more  balanced  design  of  system  clock  and  its  distribution  tree.  In  the 
case  of  dual-tone  signal  source  generation,  the  measurement  results  show  25  to  30dB 
SFDR  constantly,  even  though  the  best  working  sample  shows  up  to  40dB  SFDR. 


CHAPTER  6 

SUMMARY  AND  SUGGESTIONS  FOR  FUTURE  WORK 

With  traditional  rack-and-stack  measurement  equipment,  analog  circuit 
testing  cost  of  many  mixed-signal  designs  is  becoming  a major  portion  of  the 
manufacturing  cost.  In  order  to  alleviate  the  testing  overhead  in  developing  the 
mixed-signal  or  analog  circuit  ICs,  a more  efficient  testing  methodology  is  required, 
implying  frequency-programmable  embedded  signal  sources  for  driving  target 
DUTs. 

In  Chapter  1,  signal  source  requirements  for  embedded  self-test  were 
described,  and  the  research  goals  of  this  dissertation  were  presented.  And,  existing 
signal  generation  methods  were  reviewed  from  the  embedded  test  standpoint, 
including  ring  oscillators,  LC  oscillators,  relaxation  oscillators,  DDFS,  and  the  delta- 
sigma  periodic  bitstream  method. 

6.1  Substrate  Characterization  Using  Embedded  Signal  Source 

The  designed  signal  source  generation  circuit  can  be  integrated  on-chip 
with  main  circuits  (analog  or  mixed-signal),  to  build  up  the  embedded  self-testing 
architecture.  This  design  integration  should  be  carefully  done  in  1)  separating  main 
chip  circuits  from  the  additional  self-testing  circuits,  and  vice  versa,  and  2)  recycling 
the  existing  main  chip  circuits  for  the  use  of  self-testing  circuit  to  save  the  testing 
hardware. 

More  specifically,  substrate  noise  characterization  scheme  using 
embedded  signal  sources  was  proposed  in  Chapter  2.  The  conventional  approaches 
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for  characterizing  the  substrate  behavior,  which  are  trying  to  establish  the  electrical 
models  from  the  physical  theories  or  extract  the  circuit-level  netlist  of  the  substrate 
from  the  actual  layout,  require  substrate  model  tuning  or  the  completed  layout  to 
extract  the  design  information.  In  order  to  expedite  the  substrate  characterization  in 
the  early  design  stage,  the  proposed  method  in  Chapter  2 makes  use  of  separate 
measurement  vehicles,  which  can  be  done  simultaneously  with  main  design  work. 
With  the  additional  on-chip  signal  detection  circuit  to  pick  up  the  concerned  signal, 
the  embedded  signal  source  can  be  used  to  drive  the  specific  part  of  substrate  with 
various  structures,  such  as  various  distance  between  the  signal  source  and  detection 
circuits,  different  number  of  substrate  contacts,  or  several  protective  structures 
(guard  rings  or  deep  trench  protection). 

There  is  one  more  programmable  signal  source  requirement  not  to  be 
covered  in  this  dissertation,  which  is  signal  amplitude  tunability.  Because  many 
practical  systems  under  test  are  not  ideal  linear,  a single  amplitude  of  signal  source 
may  not  be  enough  to  investigate  the  system  functionality  within  the  full  range  of 
signal  magnitude,  even  though  the  frequency  tunability  is  provided.  To  get  the 
tunable  signal  amplitude  from  an  embedded  source,  gain-controllable  amplifier  is 
required  in  the  final  stage.  This  extra  circuit  may  cause  signal  quality  loss,  unless  it 
has  good  linearity. 

6.2  Embedded  Signal  Source  Generation  Using  Memory-Based  Periodic  Bitstream 

The  signal  source  generation  circuits  using  memory-based  periodic 
bitstreams  is  based  on  the  conventional  DDFS  and  the  delta-sigma  periodic  bitstream 
method.  By  taking  the  minimum  number  of  usable  frequency  bins  in  the  frequency 
spectrum,  the  frequency  bandwidth  and  the  bitstream  length  can  be  drastically  saved, 
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enabling  shortened  bitstreams  to  be  programmed  in  an  embedded  memory  elements. 
Also,  the  limited  number  of  usable  frequency  bins  helps  increase  the  signal  frequency 
of  generated  sources  with  the  same  system  clock. 

In  Chapter  3,  the  basic  idea  and  the  overall  architecture  of  the  proposed 
memory-based  periodic  bitstream  method  for  embedded  signal  source  generation 
were  presented,  including  software  work  using  MATLAB  to  choose  the  delta-sigma 
modulated  codes  to  be  programmed.  This  chapter  describes  design  variables  to  affect 
the  generated  signal  quality,  the  design  optimization  flow  of  MATLAB  behavioral 
model,  and  comb  filtering  idea  to  suppress  the  unnecessary  harmonic  components 
present  in  the  digitally  generated  periodic  bitstream. 

In  Chapter  4,  circuit  implementation  of  the  proposed  memory-based 
bitstream  method  using  0.25um  IBM6HP  SiGe  process  was  described.  It  consists  of 
a digital  signal  processing  part  and  an  analog  filtering  part.  The  digital  signal 
processing  blocks  include  a ROM,  a shift  register,  a variable  clock  generator,  and  a 
1 -bit  comb  filtering  circuit.  For  post-processing  to  shape  out  the  high  frequency  noise 
in  the  digital  bitstream,  rail-to-rail  digital  signal  scaling  circuit  and  a 4th-order 
tunable  GmC  low-pass  filter  are  added  to  the  final  stage.  According  to  the  Cadence 
SpectreS  simulation  results,  Rev. 2 design  shows  45dB  and  40dB  SFDR  of  single- 
tone and  dual-tone  signal  generation,  respectively.  As  a result  of  package  and  wafer 
sample  measurements,  it  is  turned  out  that  the  digital  circuit  blocks  are  working  as 
expected  with  more  timing  delay  than  in  circuit  simulation,  which  is  resulted  from 
the  routing  metal  capacitance.  However,  the  generated  signal  source  could  not  be 
measured.  After  careful  measurement  and  simulation  including  board-level  parasitic 
elements,  a couple  of  problems  were  found. 


162 


The  large  digital  output  buffer  and  clock  driver  circuitry  generated  serious 
spike  noise  at  the  time  of  digital  logic  transition.  This  catastrophic  noise  has  an 
global  effect  on  the  whole  circuit  itself.  Noise-sensitive  circuit  blocks,  such  as  signal 
scaling  circuit  for  digital-to-analog  interface  and  GmC  filter,  are  easily  disturbed  by 
this  spike  noise.  Large  capacitance  between  the  digital  power  and  ground  nodes  is 
required  to  mitigate  the  glitch  noise.  Also,  an  efficient  analog  circuit  protection 
scheme  against  the  digital  noise  via  the  substrate  should  be  built  for  noise-sensitive 
mixed-signal  designs.  A triple-well  process  would  be  a good  candidate  for  high 
performance  mixed-signal  IC  implementation  even  though  it  costs  much  more  than  a 
conventional  process. 

The  parasitic  effects  from  bondwires,  package  leadframes,  or  measuring 
probes  also  make  huge  difference  in  circuit  functionality.  The  parasitic  elements, 
such  as  inductance  and  resistance,  bring  about  a critical  power  supply  and  ground 
bouncing  problem,  which  is  a fatal  threat  to  many  analog  circuits.  The  full  chip 
simulation  including  these  parasitic  elements  shows  that  on-chip  bypass  capacitors 
are  helpful  to  reduction  of  ground  bounce  problem.  Careful  attention  should  be  paid 
to  the  location  of  the  bypass  capacitor.  Since  there  is  a parasitic  inductance  and 
resistance  between  the  I/O  pads  and  the  external  package  pins,  the  bypass  capacitor 
should  be  located  inside  the  IC.  Externally  connected  bypass  devices  may  not  be 
enough  to  remove  the  power  supply  bouncing.  Especially,  on-chip  bypass  capacitors 
need  to  be  located  close  to  the  analog  circuit  blocks  to  ensure  the  clean  power  supply 
and  ground  lines. 

To  improve  the  functional  reliability  of  the  designed  signal  source 
generation  circuit,  more  robust  analog  circuit  design  is  required.  The  PSRR  of  the 
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GmC  filter  is  very  important  design  factor,  because  the  power  and  ground  lines  of 
analog  circuit  blocks  may  be  easily  spoiled  by  several  reasons;  digital  circuit 
switching  noise  through  the  substrate,  ground  bouncing  noise  at  power  and  ground 
lines  due  to  unexpected  bondwire  or  probe  parasitic  model,  and  other  environmental 
hazards  from  practical  measurement  equipment.  The  internal  signal  level  scaling 
circuit  design  for  rail-to-rail  digital  signals  is  also  one  of  the  critical  design  factors, 
because  it  is  directly  exposed  to  high-speed  full-swing  digital  signals.  This  signal 
level  scaling  is  related  to  the  linearity  characteristic  of  the  subsequent  analog  filter 
for  noise  shaping.  Too  large  signals  for  the  inputs  of  analog  filter  cause  signal 
distortion  from  the  limited  linearity  performance  of  the  analog  filter,  and  too  small 
analog  input  signals  are  also  vulnerable  to  the  interference  from  the  digital  circuit 
blocks.  Wider  linearity  performance  of  the  analog  filter  block  enables  the  DA 
interface  circuit  to  work  on  a larger  signal  level.  A poor  measurement  environment 
could  ruin  the  sensitive  analog  signals,  even  though  they  are  properly  generated  from 
the  ICs.  Well-defined  power  supply  sources  and  high-quality  test  board  with  good 
ground  planes  are  required  for  reliable  measurement. 

There  could  be  additional  design  options  for  simplifying  the  design 
specification.  If  the  generated  signal  source  frequency  is  less  critical  than  the  signal 
quality,  we  may  have  some  redundant  frequency  bins  in  the  frequency  spectrum,  as 
shown  in  Fig.  6.1.  From  these  forbidden  frequency  components,  a wider  transition 
band  for  noise-shaping  filter  can  be  defined.  Also,  depending  on  the  frequency  tuning 
range,  passive  filtering  may  be  used  to  lessen  the  possible  design  overhead  associated 
with  the  noise-sensitive  active  filter. 
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Figure  6.1  Another  design  option  to  lessen  the  filter  design  overhead,  when  the 
generated  signal  source  frequency  is  less  critical  than  the  signal 
quality.  The  forbidden  frequency  bins  are  not  usable,  but  provide  a 
wider  transition  band  for  the  noise-shaping  filter,  which  makes  it 
possible  to  have  lower  order  filtering  or  passive  element  filtering. 

6.3  Embedded  Signal  Source  Generation  Using  LFSR  With  FIR  Filter 

In  Chapter  5,  design  difficulty  of  embedded  signal  source  generation 
associated  with  the  memory-based  periodic  bitstream  method  was  discussed;  that  is 
the  tight  filtering  specification  to  remove  the  unwanted  harmonics.  In  order  to  relax 
the  filter  design  requirements,  an  LFSR  approach  using  FIR  filtering  was  introduced, 
and  several  FIRs  were  examined  to  obtain  better  signal  quality.  The  intrinsic 
simplicity  of  this  approach  makes  the  chip  area  small,  and  the  noise-shaping  filter 
tunability  is  not  required. 

The  designed  source  generation  circuit  was  fabricated  using  0.25um  IBM 
6HP  SiGe  process,  and  the  fabricated  samples  were  measured  using  both  package  and 
wafer  samples.  Because  the  FIR1  design  has  harmonic  cancellation  zeros  at  every 
harmonic  locations  (excluding  the  fundamental  component),  its  signal  quality  was 
expected  to  be  better  than  the  FIRO  signal  source,  as  the  simulation  results  show. 
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However,  the  measurement  results  show  about  40dB  (FIRO)  and  30dB  SFDR  (FIR1) 
for  single-tone  source  generation,  though  this  measured  signal  quality  varies  with  the 
generated  frequency.  This  discrepancy  may  be  explained  by  the  structural  difference 
between  FIRO  and  FIR1  designs,  such  as  different  number  of  FIR  filter  coefficients, 
different  coefficient  value  range,  different  size  of  digital  signal  processing  parts.  The 
measured  dual-tone  signal  source  shows  25  to  30dB  SFDR.  (The  best  working 
sample  shows  up  to  42dB  SFDR,  but  it  is  not  working  in  the  expected  range  of  control 
input  voltage.) 

The  key  design  issues  of  the  LFSR  method  for  embedded  signal  source 
generation  can  be  summarized  as  follows. 

The  realization  of  a weighted  summation  circuit  to  implement  the  required 
FIR  filter  is  one  of  the  most  critical  parts,  affecting  the  generated  signal  quality 
directly.  Since  the  unwanted  harmonic  components  present  in  the  square-wave 
generated  by  using  conventional  digital  logic  blocks  should  be  eliminated  by  this 
weighted  summation  circuit,  any  device  mismatch  or  process  variation  occurred 
during  physical  circuit  implementation  may  cause  severe  signal  quality  loss  with 
excessively  large  harmonic  components  left. 

It  is  also  important  to  obtain  a good  square-wave  source.  If  the  ideal  FIR 
filter  is  given,  the  undesired  harmonic  terms  could  be  removed  by  the  FIR  filter. 
However,  a trivial  location  shift  of  system  zeros  in  the  FIR  transfer  function  may 
allow  excessively  large  gain  at  the  desired  position.  In  order  to  achieve  a more 
reliable  signal  source,  good  quality  square-waves  are  required  to  be  available  as  input 
to  the  weighted  summation  circuits.  Since  the  ideal  square-wave  has  only  odd 
harmonic  components,  an  unexpected  system  zero  shift  of  the  FIR  filter  off  the  even 
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harmonic  positions  does  not  produce  an  unwanted  harmonic  frequency  at  the  even 
harmonic  locations. 

The  well-designed  system  clock  and  its  distribution  circuitry  are 
additional  critical  factors,  especially  in  the  case  of  dual-tone  or  multi-tone  source 
generation.  The  embedded  multi-tone  signal  source  generation  using  the  LFSR 
method  must  have  multiple  LFSR  chains,  because  each  LFSR  chain  makes  the 
corresponding  single-tone  source.  With  the  LFSR  chains  designed  assuming 
identical  system  clock  sources,  a reliable  system  clock  and  its  skew-free  distribution 
are  required  for  pure  signal  generation.  For  a high  frequency  signal  generation 
system,  this  clocking  system  design  may  become  more  important. 

Finally,  the  physical  design  is  also  very  important  to  implement  reliable 
and  robust  circuits  against  statistical  and  environmental  hazards,  which  includes 
careful  floorplan  of  clock  distribution  and  LFSR  blocks  for  balanced  routing 
capacitance  and  common-centroid  layout  scheme  for  implementation  of  sensitive 
devices. 

6.4  Conclusions 

This  dissertation  has  presented  embedded  signal  source  generation 
methods  for  mixed-signal  self-test.  First,  the  memory-based  periodic  bitstream 
method  uses  software-based  oversampling  delta-sigma  modulation  and  monolithic 
hardware  for  demodulation  to  obtain  the  analog  signal  sources.  Second,  the  LFSR 
method  with  FIR  filtering  uses  the  conventional  shift  chain  registers  and  the  resistor 
arrays  to  execute  the  weighted  summation  for  FIR  filtering.  The  key  design  issues  of 
each  design  method  have  been  discussed  for  the  improvement  of  future  design  work 
and  applications.  Since  both  the  memory-based  periodic  bitstream  method  and  the 
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LFSR  method  using  FIR  filtering  are  based  on  the  digital  circuit  techniques, 
excluding  the  noise-shaping  filter,  these  circuits  are  readily  transferable  to  other 
technologies  or  scalable  to  more  advanced  short-channel  processes.  As  a specific 
system  application  example,  the  substrate  characterization  scheme  has  been 
proposed  using  the  designed  embedded  signal  source  generation  circuit. 
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